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Preface 

The following notes introduce Quantum Mechanics at an advanced level addressing students of Physics, 
Mathematics, Chemistry and Electrical Engineering. The aim is to put mathematical concepts and tech- 
niques like the path integral, algebraic techniques, Lie algebras and representation theory at the readers 
disposal. For this purpose we attempt to motivate the various physical and mathematical concepts as well 
as provide detailed derivations and complete sample calculations. We have made every effort to include in 
the derivations all assumptions and all mathematical steps implied, avoiding omission of supposedly 'trivial' 
information. Much of the author's writing effort went into a web of cross references accompanying the mathe- 
matical derivations such that the intelligent and diligent reader should be able to follow the text with relative 
ease, in particular, also when mathematically difficult material is presented. In fact, the author's driving 
force has been his desire to pave the reader's way into territories unchartered previously in most introduc- 
tory textbooks, since few practitioners feel obliged to ease access to their field. Also the author embraced 
enthusiastically the potential of the Tj^X typesetting language to enhance the presentation of equations as 
to make the logical pattern behind the mathematics as transparent as possible. Any suggestion to improve 
the text in the respects mentioned are most welcome. It is obvious, that even though these notes attempt 
to serve the reader as much as was possible for the author, the main effort to follow the text and to master 
the material is left to the reader. 

The notes start out in Section 1 with a brief review of Classical Mechanics in the Lagrange formulation and 
build on this to introduce in Section 2 Quantum Mechanics in the closely related path integral formulation. In 
Section 3 the Schrodinger equation is derived and used as an alternative description of continuous quantum 
systems. Section 4 is devoted to a detailed presentation of the harmonic oscillator, introducing algebraic 
techniques and comparing their use with more conventional mathematical procedures. In Section 5 we 
introduce the presentation theory of the 3-dimensional rotation group and the group SU(2) presenting Lie 
algebra and Lie group techniques and applying the methods to the theory of angular momentum, of the spin 
of single particles and of angular momenta and spins of composite systems. In Section 6 we present the theory 
of many-boson and many-fermion systems in a formulation exploiting the algebra of the associated creation 
and annihilation operators. Section 7 provides an introduction to Relativistic Quantum Mechanics which 
builds on the representation theory of the Lorentz group and its complex relative Sl(2, C). This section makes 
a strong effort to introduce Lorentz-invariant field equations systematically, rather than relying mainly on 
a heuristic amalgam of Classical Special Relativity and Quantum Mechanics. 

The notes are in a stage of continuing development, various sections, e.g., on the scmiclassical approximation, 
on the Hilbcrt space structure of Quantum Mechanics, on scattering theory, on perturbation theory, on 
Stochastic Quantum Mechanics, and on the group theory of elementary particles will be added as well as 
the existing sections expanded. However, at the present stage the notes, for the topics covered, should be 
complete enough to serve the reader. 

The author would like to thank Markus van Almsick and Heichi Chan for help with these notes. The 
author is also indebted to his department and to his University; their motivated students and their inspiring 
atmosphere made teaching a worthwhile effort and a great pleasure. 

These notes were produced entirely on a Macintosh II computer using the TgX typesetting system, Textures, 
Mathcmatica and Adobe Illustrator. 



Klaus Schultcn 
University of Illinois at Urbana-Champaign 
August 1991 
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Chapter 1 

Lagrangian Mechanics 



Our introduction to Quantum Mechanics will be based on its correspondence to Classical Mechanics. 
For this purpose we will review the relevant concepts of Classical Mechanics. An important concept 
is that the equations of motion of Classical Mechanics can be based on a variational principle, 
namely, that along a path describing classical motion the action integral assumes a minimal value 
(Hamiltonian Principle of Least Action). 

1.1 Basics of Variational Calculus 

The derivation of the Principle of Least Action requires the tools of the calculus of variation which 

we will provide now. 

Definition: A functional S[ ] is a map 

S[] : T -> R ; T = {q(t); q: [t ,ti] C R ->■ R M ; q(t) differentiate} (1.1) 

from a space T of vector-valued functions q(t) onto the real numbers. q(t) is called the trajec- 
tory of a system of M degrees of freedom described by the configurational coordinates q(t) = 

(qi(t),q2(t), ■ ..q M (t))- 

In case of N classical particles holds M = 3N, i.e., there are 3N configurational coordinates, 
namely, the position coordinates of the particles in any kind of coordianate system, often in the 
Cartesian coordinate system. It is important to note at the outset that for the description of a 
classical system it will be necessary to provide information q(t) as well as ^.q(t). The latter is the 
velocity vector of the system. 

Definition: A functional S[ ] is differentiate, if for any q(t) £ T and Sq(t) £ !F t where 

= {Sq(t); Sq(t) e T, \m)\ < e, < e,Vt, t G [t ,h} C R} (1.2) 

a functional SS[ ■ , • ] exists with the properties 

(i) S[q(t) + 5q(t)} = S[q(t)} + 6S[ftt),W)] + 0(e 2 ) 

(ii) 8S[q(t),5q(t)] is linear in Sq(t). (1.3) 

8S[- , •] is called the differential of S[ ]. The linearity property above implies 

5S[q{t),a 1 8q 1 (t) + a 2 Sq 2 (t)} = ai 5S[q{t)^Qli.t)} + a 2 SS[q(t),Sq 2 (t)} . (1.4) 



1 



2 



Lagrangian Mechanics 



Note: Sq(t) describes small variations around the trajectory q(t), i.e. q(t) + Sq(t) is a 'slightly' 
different trajectory than q(t). We will later often assume that only variations of a trajectory q(t) 
are permitted for which Sq(to) = and Sq\ti) = holds, i.e., at the ends of the time interval of 
the trajectories the variations vanish. 

It is also important to appreciate that 5S[ ■ , • ] in conventional differential calculus does not corre- 
spond to a differentiated function, but rather to a differential of the function which is simply the 
differentiated function multiplied by the differential increment of the variable, e.g., df = j-dx or, 



in case of a function of M variables, df = Yl 



m aj_ 

i=l dxj 



dxj 



We will now consider a particular class of functionals S[ ] which are expressed through an integral 
over the the interval [io,ii] where the integrand is a function L(q(t), ^q(t),t) of the configuration 
vector q(t), the velocity vector 4zq(t) and time t. We focus on such functionals because they play 
a central role in the so-called action integrals of Classical Mechanics. 

In the following we will often use the notation for velocities and other time derivatives ^q{t) = q(t) 

and ^ = xj. 
Theorem: Let 



S[q(t)} 



ti 



dtL(q(t),q(t),t) 



(1.5) 



to 



where L{ ■,-,■) is a function differentiable in its three arguments. It holds 



5S[q(t),m)\ 




~dL 


L 


{ -)] 




dt 


\Si S )\ 



M 



dL 



i=i 



to 



(1.6) 



For a proof we can use conventional differential calculus since the functional (1.6) is expressed in 
terms of 'normal' functions. We attempt to evaluate 



S[q(t) + Sq(t)\ 



I' dtL(q(t) +5q(t),m +Sffl,t) 
Jt 



(1.7) 



through Taylor expansion and identification of terms linear in 5qj(t), equating these terms with 
5S[q(t), 5q(t)]. For this purpose we consider 



M 



L(q(t) +5q(t),t(t) +m),t) = L(q(t),q*(t),t) + 



dL 

dq-j 



We note then using ^f(t)g{t) = 

dL 

d qj 

This yields for S[q(t) + Sq(t)} 



f(t)g(t) + f(t)g(t) 

d ( dL . 

-5q, 



dt \dq 



d dL\ 
dt dqj J 



dqj . 



+ 0(e 2 ) (U 



(1.9) 



S[q(t)\ 



rti M 

1 = 1 



dqj 



d f dL 
dt \ dqj ; j 



f 1 J ^ d f 



d f dL 

dqj 



6qj + 0(6 2 



(1.10) 



From this follows (1.6) immediately. 
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We now consider the question for which functions the functionals of the type (1.5) assume extreme 
values. For this purpose we define 

Definition: An extremal of a differentiable functional S[ ] is a function q e (t) with the property 

6S[q e (t),Sq(t)} = for all <5<f(t) G T e . (1.11) 

The extremals q e (t) can be identified through a condition which provides a suitable differential 
equation for this purpose. This condition is stated in the following theorem. 
Theorem: Euler-Lagrange Condition 

For the functional defined through (1.5), it holds in case 5q(to) = Sq(ti) = that q e (t) is an 
extremal, if and only if it satisfies the conditions (j = 1,2, ... , M) 

1(1) -w,- 

The proof of this theorem is based on the property 
Lemma: If for a continuous function f(t) 

/ : [t ,h] Ct -> 1 (1.13) 

holds 



/ 



tl 

dtf(t)h(t) = (1.14) 



'to 

for any continuous function h{t) G F t with h(to) = h{t\) = 0, then 

/(*) = on [t ,h]. (1.15) 
We will not provide a proof for this Lemma. 

The proof of the above theorem starts from (1.6) which reads in the present case 



6S[q(t),8q(t)] = / 1 dt < 

Jto 



: / r i)I. d ( dL 



E 

j'=i 



dqj dt \dq~j 



Sqj(t) } ■ (1-16) 



This property holds for any 5qj with 5q(t) G T e . According to the Lemma above follows then (1.12) 
for j = 1,2,... M. On the other side, from (1.12) for j = 1,2,... M and <%(i ) = <%(*i) = 
follows according to (1.16) the property 5S[q e (t), •] = and, hence, the above theorem. 

An Example 

As an application of the above rules of the variational calculus we like to prove the well-known result 
that a straight line in is the shortest connection (geodesies) between two points (xi,yi) and 
( x 2iU2)- Let us assume that the two points are connected by the path y(x), y(x\) = yi, y(x2) = yi- 
The length of such path can be determined starting from the fact that the incremental length ds 
in going from point (x, y(x)) to (x + dx, y{x + dx)) is 



ds s i(dx) 2 + (^_dx) 2 = dx^Jl + (^) 2 . (1.17) 
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The total path length is then given by the integral 



£V 1 + <i>'- (L18) 



s is a functional of y(x) of the type (1.5) with L(y(x), t|) = -y/l + (dy/dx) 2 . The shortest path 
is an extremal of s[y(x)] which must, according to the theorems above, obey the Euler-Lagrange 
condition. Using y' = ^| the condition reads 



d /dL\ d I y' 



dx \dy> J dx \y/l + (y') 2 



0. (1.19) 



From this follows y' j y/l + (y') 2 = const and, hence, y' = const. This in turn yields y(x) = 
ax + b. The constants a and 6 are readily identified through the conditons y(xi) = y\ and 
y{%2) = 2/2- One obtains 

y ( x ) = 1/1 - ^ 2 ( x - X2 ) + y 2 . (1.20) 
Xl - x 2 



Exercise 1.1.1: Show that the shortest path between two points on a sphere are great circles, i.e., 
circles whose centers lie at the center of the sphere. 



1.2 Lagrangian Mechanics 

The results of variational calculus derived above allow us now to formulate the Hamiltonian Prin- 
ciple of Least Action of Classical Mechanics and study its equivalence to the Newtonian equations 
of motion. 

Threorem: Hamiltonian Principle of Least Action 

The trajectories q(t) of systems of particles described through the Newtonian equations of motion 

S<»WM + g = W = V2,...M (1.21) 



are extremals of the functional, the so-called action integral, 

pti 

where L(q(t),q(t),t) is the so-called Lagrangian 

M 

2 



S[q(t)} = 1^ dtL(q(t),m,t) (1.22) 



M 1 

L(q(t),q(t),t) = ^^mjq'j - U(q 1 ,q 2 , ■ ■ ■ ,9m) ■ (1-23) 



j'=i 



Presently we consider only velocity-independent potentials. Velocity-dependent potentials which 
describe particles moving in electromagnetic fields will be considered below. 



1.2: Lagrangian 
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For a proof of the Hamiltonian Principle of Least Action we inspect the Euler-Lagrange conditions 
associated with the action integral defined through (1.22, 1.23). These conditions read in the 
present case 

~ r \ wr- ) = rim.qA = (1.24 

dqj dt \dqj J dqj dV 3H3 ' v ' 

which are obviously equivalent to the Newtonian equations of motion. 
Particle Moving in an Electromagnetic Field 

We will now consider the Newtonian equations of motion for a single particle of charge q with 
a trajectory f(t) = (x\(t), X2(t), xs(t)) moving in an electromagnetic field described through the 
electrical and magnetic field components E(r,t) and B(f,t), respectively. The equations of motion 
for such a particle are 

^(mf) = F(r,t); F(f,t) = qE(r,t) + ^vxB(f,t) (1.25) 

where ^ = v and where F(f, t) is the Lorentz force. 
The fields E(f, t) and B(r, t) obey the Maxwell equations 

Vx£ + \§- t B = (1.26) 
V-B =0 (1.27) 

VxB -±J t E =^f (1.28) 
V-E = 4tt P (1.29) 

where p(f, t) describes the charge density present in the field and J(f, t) describes the charge current 
density. Equations (1.27) and (1.28) can be satisfied implicitly if one represents the fields through 
a scalar potential V(f, t) and a vector potential A(r, t) as follows 

B = V x A (1.30) 

e --w-ig. (i.3i) 

Gauge Symmetry of the Electromagnetic Field 

It is well known that the relationship between fields and potentials (1.30, 1.31) allows one to 
transform the potentials without affecting the fields and without affecting the equations of motion 
(1.25) of a particle moving in the field. The transformation which leaves the fields invariant is 

A'(r,t) = A(r,t) + VK(f,t) (1.32) 
V'(r,t) = V(f,t) - ~K(r,t) (1.33) 
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Lagrangian of Particle Moving in Electromagnetic Field 

We want to show now that the equation of motion (1.25) follows from the Hamiltonian Principle 
of Least Action, if one assumes for a particle the Lagrangian 

L(r,'r,t) = \mv 2 - qV{f,t) + - A(f, t) ■ v . (1.34) 
— ■ c 

For this purpose we consider only one component of the equation of motion (1.25), namely, 

itrn Vl ) = F 1 = -^ i + l[^B ]l . (1.35) 

We notice using (1.30), e.g., B 3 = - 

[v x B]l = x 2 B 3 - x 3 B 2 = X2 ^_ - _j - xs ^_ - _j . (1.36) 

This expression allows us to show that (1.35) is equivalent to the Euler-Lagrange condition 

d { dL\ dL 



The second term in (1.37) is 



dt \dxi J dx\ ^' (^-37) 



8L 8V , q (dA 1 . dA 2 . dA 3 

OX\ OX\ c \ OX\ OX\ OX\ 



The first term in (1.37) is 



d_ ( dL_ \ d_ 
dt \dx\ J dt 



t \ i ^ dA.\ d q fdA 1 . , dAx . dA 1 . \ 

(mxi) H — = - mxi + - - — xi + - — x 2 + ^ — x 3 . (1.39) 

c dt dt c \ oxi ox 2 0x3 ) 



The results (1.38, 1.39) together yield 

d_ 

dt 17 dx\ ' c 



d (m±i) = + « (1.40) 



where 



dA 1 . SA 2 . dA 3 . 9Ai . 9Ai . dA 1 . 

O = — xi + - — x 2 + - — x 3 - - — xi - - — x 2 - - — x 3 
OXl OX\ OXl OX\ ox 2 ox 3 



8A 2 . 


dA 3 




dA x 


dx\ 


dxi 


x 3 - 


dx\ 


BAA 






OA; 


dx 2 J 


- x 3 ^ 


dx 3 


dx } 



. 'dA 2 UA-y \ . / UA]_ UA 3 . 

^h"-^ - *3 - -7T-) (1.41) 



which is identical to the term (1.36) in the Newtonian equation of motion. Comparing then (1.40, 
1.41) with (1.35) shows that the Newtonian equations of motion and the Euler-Lagrange conditions 
are, in fact, equivalent. 



1.3: Symmetry Properties 
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1.3 Symmetry Properties in Lagrangian Mechanics 

Symmetry properties play an eminent role in Quantum Mechanics since they reflect the properties 
of the elementary constituents of physical systems, and since these properties allow one often to 
simplify mathematical descriptions. 

We will consider in the following two symmetries, gauge symmetry and symmetries with respect to 
spatial transformations. 

The gauge symmetry, encountered above in connection with the transformations (1.32, 1.33) of elec- 
tromagnetic potentials, appear in a different, surprisingly simple fashion in Lagrangian Mechanics. 
They are the subject of the following theorem. 
Theorem: Gauge Transformation of Lagrangian 

The equation of motion (Euler-Lagrange conditions) of a classical mechanical system are unaffected 
by the following transformation of its Lagrangian 

L'{q,'q,t) = L{q,'q,t) + ~ K{q,t) (1.42) 

at c 

This transformation is termed gauge transformation. The factor | has been introduced to make this 
transformation equivalent to the gauge transformation (1.32, 1.33) of electyromagnetic potentials. 
Note that one adds the total time derivative of a function K{f, t) the Lagrangian. This term is 

d , t , dK dK dK dK dK ,., . 

_ K(M ) _ _ X1 + + _ I3 + _ _ (VJO-. + w . (1.43) 

To prove this theorem we determine the action integral corresponding to the transformed La- 
grangian 



S'[q(t)} = f U dtL\q,'q,t) = t dtL{q,i?,t) + Q - K(q, t) 

Jt JtQ c 



tl 



to 



q 



c 



tl 



= S[q(t)} + 1 K(q,t) (1.44) 



to 



Since the condition 6q(to) = Sq(t\) = holds for the variational functions of Lagrangian Me- 
chanics, Eq. (1.44) implies that the gauge transformation amounts to adding a constant term to 
the action integral, i.e., a term not affected by the variations allowed. One can conclude then 
immediately that any extremal of S'[q(t)] is also an extremal of S^i)]. 

We want to demonstrate now that the transformation (1.42) is, in fact, equivalent to the gauge 
transformation (1.32, 1.33) of electromagnetic potentials. For this purpose we consider the trans- 
formation of the single particle Lagrangian (1.34) 

L'(f,f,t) = \mv 2 - qV(f,t) + - A(f,t) ■ v + - ^-K(r,t) . (1.45) 
— ■ c c do 

Inserting (1.43) into (1.45) and reordering terms yields using (1.32, 1.33) 

L'{r,'r,t) = l -mv 2 - q (v{f,t) - \ + \ (A?,t) + VK^j -v 

= -mv 2 - qV'(r,t) + - A'(r, t) ■ v . (1.46) 
— ■ c 
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Obviously, the transformation (1.42) corresponds to replacing in the Lagrangian potentials V(f, t), A(f, t) 
by gauge transformed potentials V'(f,t), A'(f,t). We have proven, therefore, the equivalence of 
(1.42) and (1.32, 1.33). 

We consider now invariance properties connected with coordinate transformations. Such invariance 
properties are very familiar, for example, in the case of central force fields which are invariant with 
respect to rotations of coordinates around the center. 

The following description of spatial symmetry is important in two respects, for the connection 
between invariance properties and constants of motion, which has an important analogy in Quantum 
Mechanics, and for the introduction of infinitesimal transformations which will provide a crucial 
method for the study of symmetry in Quantum Mechanics. The transformations we consider are 
the most simple kind, the reason being that our interest lies in achieving familiarity with the 
principles (just mentioned above ) of symmetry properties rather than in providing a general tool 
in the context of Classical Mechanics. The transformations considered are specified in the following 
definition. 

Definition: Infinitesimal One-Parameter Coordinate Transformations 
A one-parameter coordinate transformation is decribed through 

f> = r"(r,e) , f, r G , e e R (1.47) 

where the origin of e is chosen such that 

f(f,0) = f . (1.48) 
The corresponding infinitesimal transformation is defined for small e through 

— * — * c)j^ 
/(r,e) = f + eR(r) + 0(e 2 ) ; R(r) = — 

In the following we will denote unit vectors as d, i.e., for such vectors holds a ■ a = 1. 



(1.49) 



Examples of Infinitesimal Transformations 

The beauty of infinitesimal transformations is that they can be stated in a very simple manner. In 
case of a translation transformation in the direction e nothing new is gained. However, we like to 
provide the transformation here anyway for later reference 

f' = f + ee . (1.50) 

A non-trivial example is furnished by the infinitesimal rotation around axis e 

r = f + eexr. (1-51) 

We would like to derive this transformation in a somewhat complicated, but nevertheless instructive 
way considering rotations around the X3~axis. In this case the transformation can be written in 
matrix form 

cose —sine 

sine cose | | X2 \ (1-52) 
1 











(5) 


-( 
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In case of small e this transformation can be written neglecting terms 0(e 2 ) using cose = 1 + 0(e 2 ), 
sine = e + 0(e 2 ) 



+ e x 2 + 0(e 2 ) ■ (1-53) 






One can readily verify that in case e = £3 (ej denoting the unit vector in the direction of the 
Xj axis ) (1.51) reads 

r" = r — X2 e~i + x\ e.2 (1-54) 

which is equivalent to (1.53). 

Anytime, a classical mechanical system is invariant with respect to a coordinate transformation 
a constant of motion exists, i.e., a quantity C(f,r) which is constant along the classical path of 
the system. We have used here the notation corresponding to single particle motion, however, the 
property holds for any system. 

The property has been shown to hold in a more general context, namely for fields rather than only 
for particle motion, by Noether. We consider here only the 'particle version' of the theorem. Before 
the embark on this theorem we will comment on what is meant by the statement that a classical 
mechanical system is invariant under a coordinate transformation. In the context of Lagrangian 
Mechanics this implies that such transformation leaves the Lagrangian of the system unchanged. 
Theorem: Noether 's Theorem 

If L(q,q,t) is invariant with respect to an infinitesimal transformation <f = q + eQ(q), then 
YljLi Qj§Sr 1S a cons t an t °f motion. 

We have generalized in this theorem the definition of infinitesimal coordinate transformation to 

M-dimensional vectors q. 

In order to prove Noether's theorem we note 



q'j = Qj + eQj(q) (1.55) 

# =^ + e ESr*. (i.56) 



k=l ™ 

Inserting these infinitesimal changes of qj and <jj into the Lagrangian L(q, q, t) yields after Taylor 
expansion, neglecting terms of order 0(e 2 ), 

M FIT M f)T r)f) 

L'(q,qM) = L(q,qM) + e^^ + e £ ^^q, (1.57) 

0=1 aq i j,k=i OQj OQk 

where we used ^Qj = X^fel=i(gf^Qj) ( ?fc. Invariance implies L' = L, i.e., the second and third term 
in (1.57) must cancel each other or both vanish. Using the fact, that along the classical path holds 
the Euler-Lagrange condition = ^( J^) one can rewrite the sum of the second and third term 
in (1.57) 

From this follows the statement of the theorem. 
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Application of Noether's Theorem 

We consider briefly two examples of invariances with respect to coordinate transformations for the 



We first determine the constant of motion in case of invariance with respect to translations as 
defined in (1.50). In this case we have Qj = i-j ■ e, j = 1,2, 3 and, hence, Noether's theorem yields 
the constant of motion (qj = Xj, j = 1, 2, 3) 



We obtain the well known result that in this case the momentum in the direction, for which 
translational invariance holds, is conserved. 

We will now investigate the consequence of rotational invariance as described according to the 
infinitesimal transformation (1.51). In this case we will use the same notation as in (1.59), except 
using now Qj = e.j ■ (e x r). A calculation similar to that in (1.59) yields the constant of motion 
(e x r) ■ mv. Using the cyclic property (a x b) • c = (bx c) ■ a = (c x a) -b allows one to rewrite the 
constant of motion e- (fx mv) which can be identified as the component of the angular momentum 
mf x v in the e direction. It was, of course, to be expected that this is the constant of motion. 
The important result to be remembered for later considerations of symmetry transformations in 
the context of Quantum Mechanics is that it is sufficient to know the consequences of infinitesimal 
transformations to predict the symmetry properties of Classical Mechanics. It is not necessary to 
investigate the consequences of global, i.e, not infinitesimal transformations. 





(1.59) 



Chapter 2 

Quantum Mechanical Path Integral 



2.1 The Double Slit Experiment 

Will be supplied at later date 

2.2 Axioms for Quantum Mechanical Description of Single Parti- 
cle 

Let us consider a particle which is described by a Lagrangian L(f,f,t). We provide now a set of 
formal rules which state how the probability to observe such a particle at some space-time point 
f, t is described in Quantum Mechanics. 

1. The particle is described by a wave function ip(f,t) 

tp : M 3 <g> M -»• C. (2.1) 

2. The probability that the particle is detected at space-time point f,t is 

mf,t)\ 2 =wj)mt) (2.2) 

where z is the conjugate complex of z. 

3. The probability to detect the particle with a detector of sensitivity f(r) is 

/ d\f{mm\ 2 (2.3) 

where CI is the space volume in which the particle can exist. At present one may think of 
f(r) as a sum over ^-functions which represent a multi-slit screen, placed into the space at 
some particular time and with a detector behind each slit. 

4. The wave function if;(f, t) is normalized 

/ d 3 r\i;(r,t)\ 2 = 1 Vi, t € [t ,ti] , (2.4) 

a condition which enforces that the probability of finding the particle somewhere in 0, at any 
particular time t in an interval [io,£i] in which the particle is known to exist, is unity. 
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5. The time evolution of ip(r, t) is described by a linear map of the type 

ij){f,t) = [ rf 3 r>(r,i|r,t')^r,0 t > t' , t,t' G [t , *i] (2.5) 

6. Since (2.4) holds for all times, the propagator is unitary, i.e., (t > t', t,t' £ [to,ti]) 

f n d*rmr, t)\ 2 = 

In d3r In dV In dV ' <K f > *K> <K^> *0 ^> ^(**'> #) 

= f Q d*r\mt')\ 2 = 1. (2.6) 

This must hold for any if){f ,t') which requires 

/ d 3 r' <j)(r,t\?,t')<j)(f,t\r*',t') = - f") (2.7) 
Jn 



7. The following so-called completeness relationship holds for the propagator (t > t' t,t' £ 

[*o,*i]) 

/ d 3 r</.(r,t|r',0^,* / |n),to) = 0(r,t|r o ,i o ) (2.8) 

This relationship has the following interpretation: Assume that at time to a particle is gen- 
erated by a source at one point fo in space, i.e., ip(fo, to) = S(r — fo). The state of a system 
at time t, described by ip(r, t), requires then according to (2.8) a knowledge of the state at all 
space points f* G fiat some intermediate time t'. This is different from the classical situation 
where the particle follows a discrete path and, hence, at any intermediate time the particle 
needs only be known at one space point, namely the point on the classical path at time t'. 

8. The generalization of the completeness property to N — 1 intermediate points t > tjy-i > 
tN-2 > ••• > h > t is 

<p{f,t\f ,t ) = J n <fir N -! In d3r N-2 ■■■ I n d 3 n 
(/>(rt\r N -i,t N -i) (t>(r N -i,t N -i\r N -2,t N -2) ■ ■ ■ <f>(ri,ti\r ,t ) ■ (2.9) 

Employing a continuum of intermediate times t' € [to,h] yields an expression of the form 

fff(t N )=r N 

4>(f,t\r ,t ) = d[r(t)]^[f(t)]- (2-10) 

JJr(tt))=n) 

We have introduced here a new symbol, the path integral 

rrr(t N )=f N 

// dm]- (2-H) 

Mr(t )=f 

which denotes an integral over all paths f(t) with end points f(to) = fo and r(ijv) = "Hv- 
This symbol will be defined further below. The definition will actually assume an infinite 
number of intermediate times and express the path integral through integrals of the type 
(2.9) for N -> oo. 



2.2: Axioms 
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9. The functional $[r(t)] in (2.11) is 



<t>[r(t)] = expj ^[r(t)]| (2.12) 
where 5[f{i)] is the classical action integral 

S[f(t)\ = / dtL(r,r,t) (2.13) 

and 

ft = 1.0545 • 10~ 27 erg s . (2.14) 

In (2.13) L(r, r, t) is the Lagrangian of the classical particle. However, in complete distinction 
from Classical Mechanics, expressions (2.12, 2.13) are built on action integrals for all possible 
paths, not only for the classical path. Situations which are well described classically will be 
distinguished through the property that the classical path gives the dominant, actually often 
essentially exclusive, contribution to the path integral (2.12, 2.13). However, for microscopic 
particles like the electron this is by no means the case, i.e., for the electron many paths 
contribute and the action integrals for non-classical paths need to be known. 



The constant h given in (2.14) has the same dimension as the action integral S[r(t)]. Its value 
is extremely small in comparision with typical values for action integrals of macroscopic particles. 
However, it is comparable to action integrals as they arise for microscopic particles under typical 
circumstances. To show this we consider the value of the action integral for a particle of mass 
m = 1 g moving over a distance of 1 cm/s in a time period of 1 s. The value of S^t)] is then 

S c l = ^mv 2 t = ^ergs. (2.15) 

The exponent of (2.12) is then S c i/h ~ 0.5 • 10 27 , i.e., a very large number. Since this number is 
multiplied by 'i', the exponent is a very large imaginary number. Any variations of S c i would then 
lead to strong oscillations of the contributions exp(^S') to the path integral and one can expect 
destructive interference betwen these contributions. Only for paths close to the classical path is 
such interference ruled out, namely due to the property of the classical path to be an extremal of 
the action integral. This implies that small variations of the path near the classical path alter the 
value of the action integral by very little, such that destructive interference of the contributions of 
such paths does not occur. 

The situation is very different for microscopic particles. In case of a proton with mass m = 
1.6725 • 10~ 24 g moving over a distance of 1 A in a time period of 10~ 14 s the value of is 
S c i ~ 10~ 26 erg s and, accordingly, S c i/h ~ 8. This number is much smaller than the one for the 
macroscopic particle considered above and one expects that variations of the exponent of 3>[r(t)] 
are of the order of unity for protons. One would still expect significant descructive interference 
between contributions of different paths since the value calculated is comparable to 2tt. However, 
interferences should be much less dramatic than in case of the macroscopic particle. 



14 



Quantum Mechanical Path Integral 



2.3 How to Evaluate the Path Integral 

In this section we will provide an explicit algorithm which defines the path integral (2.12, 2.13) 
and, at the same time, provides an avenue to evaluate path integrals. For the sake of simplicity we 
will consider the case of particles moving in one dimension labelled by the position coordinate x. 
The particles have associated with them a Lagrangian 

L(x,x,t) = ^mx 2 - U{x) . (2.16) 

In order to define the path integral we assume, as in (2.9), a series of times > t^-i > > 
. . . > t\ > to letting N go to infinity later. The spacings between the times tj+i and tj will all be 
identical, namely 

t j+1 - tj = (t N - t )/N = e N . (2.17) 

The discretization in time leads to a discretization of the paths x{t) which will be represented 
through the series of space-time points 

{(x , t ), (xi,h), . . . (x N -i,t N -i), (x N , t N )} . (2.18) 

The time instances are fixed, however, the xj values are not. They can be anywhere in the allowed 
volume which we will choose to be the interval ] — oo, oo[. In passing from one space-time instance 
(xj,tj) to the next (xj+i,tj+i) we assume that kinetic energy and potential energy are constant, 
namely ^ m ( x j+i ~ x j) 2 / e N an d U(xj), respectively. These assumptions lead then to the following 
Riemann form for the action integral 

S[x(t)} = Hm ejv (\m ^ +1 - X ^ - U( Xj )) . (2.19) 

The main idea is that one can replace the path integral now by a multiple integral over xi, x%, etc. 
This allows us to write the evolution operator using (2.10) and (2.12) 

4>(x N ,t N \x ,t ) = limjv^ooCTv f*™ dxi f*™ dx 2 ■ ■ ■ f*™ dx N -i 

exp { | e N Efj-o 1 [ - U( Xj ) ] } . (2.20) 

Here, Cn is a constant which depends on N (actually also on other constant in the exponent) which 
needs to be chosen to ascertain that the limit in (2.20) can be properly taken. Its value is 

— 

Cn = m 2 (2 21) 

2iriheN 



2.4 Propagator for a Free Particle 

As a first example we will evaluate the path integral for a free particle following the algorithm 
introduced above. 



2.4: Propagator for a Free Particle 
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Rather then using the integration variables Xj, it is more suitable to define new integration variables 
yj, the origin of which coincides with the classical path of the particle. To see the benifit of such 
approach we define a path y(t) as follows 

x(t) = x d (t) + y(t) (2.22) 

where x c i(t) is the classical path which connects the space-time points (xo, to) and (xjv, t^), namely, 



Xcl(t) 



XQ + 



(t - to) . 



(2.23) 



tjV — to 

It is essential for the following to note that, since x{to) = x c i(to) = xoandx(iAr) = x c i(t^[) = xjy, 
it holds 

y(to) = y(t N ) = o . (2.24) 

Also we use the fact that the velocity of the classical path x c \ = (xat — xo) / (t n — to) is constant. The 
action integral 1 S[x(t)\x(to) = xq, x(i/v) = xn] for any path x(t) can then be expressed through 
an action integral over the path y{t) relative to the classical path. One obtains 

S[x(t)\x(t ) = x , x(t N ) = x N ] = j^dt\m(x 2 d + 2x c /y + y 2 ) = 

Hytlmxl + midf^dty + dt\my* . 

The condition (2.24) implies for the second term on the r.h.s. 

dty = y(t N ) - y(t ) = . 



/ 



The first term on the r.h.s. of (2.25) is, using (2.23), 



I" 



1 



dt-m x d = -m 



1 (XN 



xo) 



tN — ^0 



(2.25) 
(2.26) 

(2.27) 
(2.28) 



The third term can be written in the notation introduced 

t N i 

dt-my 2 = S[x(t)\x(t ) = 0, x(t N ) = 0] , 
to 2 

i.e., due to (2.24), can be expressed through a path integral with endpoints x(to) = 0, x(tjv) = 0. 
The resulting expression for S[x(t)\x(to) = xo, x(tjy) = xn] is 

1 (x N - x ) 2 



^^(^Ix^o) = x , x(t N ) = X N ] 



-m- 
2 tN — to 



+ + 



(2.29) 



+ S[x(t)\x(t ) = 0, x(t N ) = 0] . 



This expression corresponds to the action integral in (2.13). Inserting the result into (2.10, 2.12) 
yields 



(j)(x N ,t N \xo,to) = exp 
a result, which can also be written 



im (xtv — xo 
2h tjv — 



,21 



jrx(t N )=0 
x(t )=0 



4>(x N ,t N \xo,to) 



exp 



im (xat — xo) z 
2h t]\f — to 



d[x(t)\ exp \ - S[x(t)\ 



<j)(0,t N \0,to) 



(2.30) 



(2.31) 



1 We have denoted explicitly that the action integral for a path connecting the space-time points (x ,t ) and 
(xn^n) is to be evaluated. 
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Evaluation of the necessary path integral 



To determine the propagator (2.31) for a free particle one needs to evaluate the following path 
integral 



(f)(0,t N \0,t o ) = lim N - 



x f-™d yi ■ ■ ■ f+™dy N -! exp | e N £ 



j=0 2 m ? 



The exponent £" can be written, noting yo = 2/tv = 0, as the quadratic form 



E = 



im 



( - yij/2 - 2/22/1 + 2y| - y 2 2/3 - 2/32/2 

VN-2VN-1 - VN-WN-2 + 22/ir_i ) 



JV-1 

j,k=l 



where are the elements of the following symmetric (N — 1) x (iV — 1) matrix 



J-jk 



m 



( 


2 


-1 


.. 





o \ 




-1 


2 


-1 .. 













-1 


2 .. 
















.. 


2 


-1 


V 











-1 


2 / 



The following integral 



dyi 



r+oc I N-l \ 

/ dy N ^exp i ^ Vja jk y k 
J -°° \ j,k=i J 



must be determined. In the appendix we prove 



f+OO 



/+oo /H t i 

dyi ■ ■ ■ / dyjv-i exp i ^ VjbjkVk 



det(bjfc) 



(2.32) 



(2.33) 



(2.34) 



(2.35) 



(2.36) 



which holds for a d-dimensional, real, symmetric matrix (bjk) and det(bjk) ^ 0. 

In order to complete the evaluation of (2.32) we split off the factor i^— in the definition (2.34) of 

{a,jk) defining a new matrix (Ajk) through 



djk 



m 



A 



jk 



Using 



det(a jk ) 



m 
2 Tie at 



N-l 



det(A jk ) , 



(2.37) 



(2.38) 



2.4: Propagator for a Free Particle 
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a property which follows from det(cB) = c n detB for any n x n matrix B, we obtain 

1 



(f)(0,t N \0,t o ) = lim N - 



m 


2 


2mheN 






m 



N-l 
2 



y/det(A jk ) 



(2.39) 



In order to determine det(Ajk) we consider the dimension n of (Aj/,), presently N — 1, variable, let 
say n, n = 1, 2, . . .. We seek then to evaluate the determinant of the n x n matrix 



( 2 -1 

-1 2 -1 

0-1 2 



\ 



\ 



2 -1 
-1 2 / 



(2.40) 



For this purpose we expand (2.40) in terms of subdeterminants along the last column. One can 
readily verify that this procedure leads to the following recursion equation for the determinants 



Dr. 



2D n -i — D n -2 . 



To solve this three term recursion relationship one needs two starting values. Using 



£>i = |(2)| = 2; D 2 



2 -1 
-1 2 



one can readily verify 



D n = n + 1 . 



(2.41) 

(2.42) 
(2.43) 



We like to note here for further use below that one might as well employ the 'artificial' starting 

values Dq = 1, D\ = 2 and obtain from (2.41) the same result for D 2 , D%, .... 

Our derivation has provided us with the value det(A,-fc) = N. Inserting this into (2.39) yields 



(j)(0,t N \0,t ) = lirriN- 



m 



2-iriheNN 

and with e^A^ = t^ — to , which follows from (2.18) we obtain 



(2.44) 



0(O,tjv|O,to) = 

Expressions for Free Particle Propagator 



m 



2mh(tN — to) 



(2.45) 



We have now collected all pieces for the final expression of the propagator (2.31) and obtain, defining 

t = ijv, x = xn 



4>(x,t\xo,t ) 



m 



2mh(t - t ). 



exp 



im (x — xo) z 
~2h t-t Q 



(2.46) 
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This propagator, according to (2.5) allows us to predict the time evolution of any state function 
ip(x, t) of a free particle. Below we will apply this to a particle at rest and a particle forming a 
so-called wave packet. 

The result (2.46) can be generalized to three dimensions in a rather obvious way. One obtains then 
for the propagator (2.10) 



<f>(r,t\r ,t ) = 



m 



2mh(t - t ) 



exp 



im (r — foY 
~2h t-t 



(2.47) 



One-Dimensional Free Particle Described by Wave Packet 

We assume a particle at time t = t Q = is described by the wave function 



Obviously, the associated probability distribution 



1 




_tt5 2 _ 


exp ^ 



x i P° 



tt5 2 



exp 



' 5 2 



is Gaussian of width 5, centered around xq = 0, and describes a single particle since 



tt5 2 



dx exp ( -^f 



1 . 



(2.48) 



(2.49) 



(2.50) 



One refers to such states as wave packets. We want to apply axiom (2.5) to (2.48) as the initial 
state using the propagator (2.46). 

We will obtain, thereby, the wave function of the particle at later times. We need to evaluate for 
this purpose the integral 



1 


4 


" m ' 


n5 2 _ 




^2i\i%t. 



+oo 



dxo exp 



im (x — xo) z 

2h 



■ Po 



t 



(2.51) 



E (x ,x) + E(x) 

For this evaluation we adopt the strategy of combining in the exponential the terms quadratic 
(~ Xq) and linear (~ xq) in the integration variable to a complete square 

2 _ b l 
a 



ax + 2bxo = a ( xq + - 



(2.52) 



and applying (2.247). 

We devide the contributions to the exponent E (x ,x) + E(x) in (2.51) as follows 



Eq^Xqj x) — 



im 
2M 



xl\\ 



\ mo £ J V m / 



fix) 



(2.53) 



2.4: Propagator for a Free Particle 
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E(x) 



im 

2ht 



x 



/(*) 



(2.54) 



One chooses then f(x) to complete, according to (2.52), the square in (2.53) 

/ \ 2 

f(x) - ' 



l + i 



ht 



(2.55) 



im 
2ht 



1 


4 


" m ' 


_7r5 2 _ 




.2mht. 



This yields 

-^0(^0; *^) 

One can write then (2.51) 
and needs to determine the integral 

/+oo 
dx e E ° {Xo ^ 
-oo 

/+oo 
dxo exp 

/+oo 
dxo exp 
-oo 



x \ l + i- 



ht 



x - 

m 



5 2 



yf 



(2.56) 



/+oo 
dx e E "^ 
-oo 



(2.57) 



im 
2ht 



x \ 1 + i 



ht 

m5 2 



x — — / 



l+i 



ht 

m8 2 



im 
2ht 



l + i- 



ht 



rn 



Po 



Xn 



l+i 



ht 

m& 2 



(2.58) 



The integrand is an analytical function everywhere in the complex plane and we can alter the inte- 
gration path, making certain, however, that the new path does not lead to additional contributions 
to the integral. 

We proceed as follows. We consider a transformation to a new integration variable p defined through 



i I 



ht \ 



m5 2 



xq 



x - ^t 



l + i 



ht 

l>l<P 



(2.59) 



An integration path in the complex XQ-plane along the direction 



ill — * 



ht 

m5 2 



(2.60) 



is then represented by real p values. The beginning and the end of such path are the points 



-oo x 4 / i ( 1 



ht 



m 



6 2 J ' 



4 



+oo x a i 1 



ht 



rn 



(2.61) 
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whereas the original path in (2.58) has the end points 

Z\ = — OO , Z2 = +00 . 

If one can show that an integration of (2.58) along the path z\ — ► z[ and along the path zi 
gives only vanishing contributions one can replace (2.58) by 



(2.62) 



4 



1 = j i 1 - i 



ht 

m5 2 



dp exp 



ml f ht 
2ht \ + (mlp 



(2.63) 



which can be readily evaluated. In fact, one can show that z[ lies at — 00 — i x 00 and z' 2 at 
+00 + i x 00. Hence, the paths between z\ — > z[ and Z2 — > ^ nave a rea l P ar t of of ±00. Since 
the exponent in (2.58) has a leading contribution in xq of — Xq/5 2 the integrand of (2.58) vanishes 
for Rexo — > ±00. We can conclude then that (2.63) holds and, accordingly, 



2mht 



Equation (2.57) reads then 
Seperating the phase factor 



h- 1 


1 

4 


1 









exp [E(x) 



1 - i 
1 + i 



ht 

mS 2 

ht 

mS 2 



yields 



ip(x,t) 



"1 - 


i m 

mS' 2 


4 


1 


4 


1 + 


I ht 

mS 2 _ 









exp [-E(x) 



We need to determine finally (2.54) using (2.55). One obtains 

E{x) = 



+ 



2^(1 + A) 1 + 



i^x 



; ht 

mi)' 2 



l Vo f 
H2m L 



and, using 



finally 



= a — 



£(x) 



a 6 

1 + & f+~6 ' 

JT7 + l — X 



fi. 2m 



(2.64) 



(2.65) 



(2.66) 



(2.67) 



(2.68) 



(2.69) 



(2.70) 



2.4: Propagator for a Free Particle 
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which inserted in (2.67) provides the complete expression of the wave function at all times t 



ip(x,t) = 





4 


1 


4 











(2.71) 



(x - ^t) 2 



x exp 

The corresponding probability distribution is 



2* 2 (1 + S) (1 



ft* .p 



H 2m 



tt5 2 (1 + 4fi 



exp 



(*-E*) 2 



<5 2 (1 + 



(2.72) 



Comparision of Moving Wave Packet with Classical Motion 



It is revealing to compare the probability distributions (2.49), (2.72) for the initial state (2.48) and 
for the final state (2.71), respectively. The center of the distribution (2.72) moves in the direction 
of the positive x-axis with velocity v Q = p /m which identifies p Q as the momentum of the particle. 
The width of the distribution (2.72) 



5\ 1 + 



h 2 t 2 



m z 6 



2A4 



(2.73) 



increases with time, coinciding at t = with the width of the initial distribution (2.49). This 
'spreading' of the wave function is a genuine quantum phenomenon. Another interesting observation 
is that the wave function (2.71) conserves the phase factor exp[i(p /h)x] of the original wave function 
(2.48) and that the respective phase factor is related with the velocity of the classical particle and 
of the center of the distribution (2.72). The conservation of this factor is particularly striking for 
the (unnormalized) initial wave function 



ip(xo,t ) = exp(i—x ) , 
Km ) 

which corresponds to (2.48) for 5 — ► oo. In this case holds 

( Po ^ 

ip(x,t) = exp I i— x — —t 

\ m n2m 



(2.74) 



(2.75) 



i.e., the spatial dependence of the initial state (2.74) remains invariant in time. However, a time- 
dependent phase factor exp[— j^(p 2 /2m) t] arises which is related to the energy e = p 2 /2m of a 
particle with momentum p Q . We had assumed above [c.f. (2.48)] t Q = 0. the case of arbitrary t is 
recovered iby replacing t — > t in (2.71, 2.72). This yields, instead of (2.75) 



ip(x, t) = exp 



■ Po 

i — x 
m 



i_pi_ 
H 2m 



(t ~ t ) 



From this we conclude that an initial wave function 

I Po 

ip(x o ,t ) = exp i— x c 
\ m 



i P 2 o 
H 2m 



to 



(2.76) 



(2.77) 



22 



Quantum Mechanical Path Integral 



becomes at t > t 

Y>0M) = expf i^x - l -^t) , (2.78) 
\ m him J 

i.e., the spatial as well as the temporal dependence of the wave function remains invariant in this 
case. One refers to the respective states as stationary states. Such states play a cardinal role in 
quantum mechanics. 



2.5 Propagator for a Quadratic Lagrangian 

We will now determine the propagator (2.10, 2.12, 2.13) 

r x(t N )=x N 



n-x(t N )=x N r ^ \ 

<Kx N ,t N \x ,to) = // d[x(t)} exp I - S[x(t)} } (2.79) 

Mx(to)=x I n J 



for a quadratic Lagrangian 



L(x,x,t) = \mix 2 - ^c{t)x 2 - e(t)x . (2.80) 



For this purpose we need to determine the action integral 



S[x(t)] = / dt'L(x,x,t) (2.81) 

Jto 



for an arbitrary path x{t) with end points x(to) = Xo and x(tjy) = xn- In order to simplify this 
task we define again a new path y(t) 

x(t) = x d {t) + y(t) (2.82) 

which describes the deviation from the classical path x c i(t) with end points x c i(to) = xq and 
%cl(tN) = xn- Obviously, the end points of y{t) are 

y(to) = ; y(t N ) = . (2.83) 

Inserting (2.80) into (2.82) one obtains 

L(x d + y,x d + y(t),t) = L(x d ,x d ,t) + L'(y,y(t),t) + 5L (2.84) 

where 

L(x d ,x d ,t) = -mx d - -c(t)x d - e(t)x d 

L'(y,y(t),t) = l -my 2 - ^c(t)y 2 

5L = mx d y(t) - c(t)x d y - e(t)y . (2.85) 

We want to show now that the contribution of 8L to the action integral (2.81) vanishes 2 . For this 
purpose we use 

XciV = -J t ^ciy) ~ x d y (2.86) 

2 The reader may want to verify that the contribution of SL to the action integral is actually equal to the differential 
5S[x c i,y(t)] which vanishes according to the Hamiltonian principle as discussed in Sect. 1. 
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and obtain 



rtN 

/ ( 

J t 



dtSL = m [±ciy]\ 



to 



dt [mx d (t) + c(t)x cl (t) + e(t)] y(t) . 



(2.87) 



According to (2.83) the first term on the r.h.s. vanishes. Applying the Euler-Lagrange conditions 
(1.24) to the Lagrangian (2.80) yields for the classical path 

mx d + c(t)x d + e(t) = (2.88) 

and, hence, also the second contribution on the r.h.s. of (2.88) vanishes. One can then express the 
propagator (2.79) 



exp 



h 



where 



(f)(x N ,t N \x ,t ) 

Jry(tN)=u 
d[y(t)\ 
y(*o)=o 



S[x cl (t)} } 4>(0,t N \0,t ) 



>y(t N )=0 
'w(to) 

Evaluation of the Necessary Path Integral 



exp 



rtN 

/ ( 

J to 



dtL'(y,y,t) 



(2.89) 
(2.90) 



We have achieved for the quadratic Lagrangian a separation in terms of a classical action integral 
and a propagator connecting the end points y(to) = and y(tjv) = which is analogue to the 
result (2.31) for the free particle propagator. For the evaluation of </>(0, ijv|0, to) we will adopt 
a strategy which is similar to that used for the evaluation of (2.32). The discretization scheme 
adopted above yields in the present case 



4>{0,t N \0,t ) = UrriN- 



x 



J-^ d yi ■ ■ ■ S-< ™dVN-i exp 



i c / l 

n eN ^j=o ^ 2 



2niheN 



(2.91) 



m- 



4 



where Cj = c(tj), tj = to + j- One can express the exponent E in (2.91) through the quadratic 
form 

N-l 

E = i ^2 yjdjkVk 

j,k=l 

where cijk are the elements of the following (N — 1) x (N — 1) matrix 

/ 



(2.92) 



m 



2/iejy 



ejv 

2h 



V 

c 2 
c 3 



2 


-1 








o\ 


1 


2 


-1 











-1 


2 

















2 


-1 











-1 


2 / 













\ 





V o o 


















CN-2 

Civ_i / 



(2.93) 
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In case det(ajfc) / one can express the multiple integral in (2.91) according to (2.36) as follows 
0(O,ijv|O,to) = lirriN^K. 



lirriN- 



m 


N 
2 




2-KiheN 




det(a) 


m 




1 



27ri/i ( 2he N 

t_/V 



JV-1 



det(a) 



(2.94) 



In order to determine <^>(0, ijv|0, to) we need to evaluate the function 

JV-1 



/(*o,*jv) = Um N - 



e N det(a) 

V m J 



According to (2.93) holds 



/ 2 - ^ 



V 



ci 

-1 2 







D N -i 
-1 



def 



m 



-1 



JV-l 



det(a) 



(2.95) 



(2.96) 



-12- ^c 3 



















o\ 






'-CN-2 -1 

"I 2 - f ov-i / 



In the following we will asume that the dimension n = N — 1 of the matrix in (2.97) is variable. 
One can derive then for D n the recursion relationship 



D r 



C JV 

m 



D 



n-l 



D 



71-2 



(2.97) 



using the well-known method of expanding a determinant in terms of the determinants of lower 
dimensional submatrices. Using the starting values [c.f. the comment below Eq. (2.43)] 



D n 



1 



^JV 

m 



(2.98) 



this recursion relationship can be employed to determine Djv-i- One can express (2.97) through 
the 2nd order difference equation 



D n +i — 2D n + D n -\ 



Cn+l -D,, 



-JV 



m 



(2.99) 



Since we are interested in the solution of this equation in the limit of vanishing ejy we may interpret 
(2.99) as a 2nd order differential equation in the continuous variable t = nejy + to 



d 2 f(tp,t) 
dt 2 



c(t) 



m 



/(to,*) • 



(2.100) 
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The boundary conditions at t = to, according to (2.98), are 



f{t ,t ) 



df(t ,t) 



dt 



t=t 



£7V 



= 0; 



We have then finally for the propagator (2.79) 



m 



1 . 



(2.101) 



(f)(x,t\x ,t ) 



m 



2mhf(t ,t)_ 



2 I % 
exp{ -S[x d (t)} 



h 



(2.102) 



where f(to,t) is the solution of (2.100, 2.101) and where S[x c i(t)] is determined by solving first the 
Euler-Lagrange equations for the Lagrangian (2.80) to obtain the classical path x c i{t) with end 
points x c i(to) = xo and x c i(t]y) = xn and then evaluating (2.81) for this path. Note that the 
required solution x c i(t) involves a solution of the Euler-Lagrange equations for boundary conditions 
which are different from those conventionally encountered in Classical Mechanics where usually a 
solution for initial conditions x c i(to) = xq and x c i(to) = vq are determined. 



2.6 Wave Packet Moving in Homogeneous Force Field 



We want to consider now the motion of a quantum mechanical particle, decribed at time t = t Q 
by a wave packet (2.48), in the presence of a homogeneous force due to a potential V{x) = — fx. 
As we have learnt from the study of the time-development of (2.48) in case of free particles the 
wave packet (2.48) corresponds to a classical particle with momentum p Q and position x Q = 0. 
We expect then that the classical particle assumes the following position and momentum at times 
t > t Q 



y(t) 

p(t) 



Po 



(t - t 

m 

Po + / (t - t 



2 m 



to) 



The Lagrangian for the present case is 

L(x, x, t) 



-mx 2 + fx 
2 J 



(2.103) 
(2.104) 



(2.105) 



This corresponds to the Lagrangian in (2.80) for c(t) = 0, e(t) = —f. Accordingly, we can employ 
the expression (2.89, 2.90) for the propagator where, in the present case, holds L'(y,y,t) = \my 2 
such that <ji>(0, ij\r|0, to) is the free particle propagator (2.45). One can write then the propagator 
for a particle moving subject to a homogeneous force 



m 



2mh(t - t )_ 



exp 



n 



S[xci(t)} 



4>(x,t\x ,t ) ■ 

Here S[x c i(t)] is the action integral over the classical path with end points 



(2.106) 



(2.107) 
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The classical path obeys 

The solution of (2.107, 2.108) is 

X d (T) = x 



mx cl = f . 
1 / 



t — t„ 2 m 
as can be readily verified. The velocity along this path is 

x - x If 



(t - t 



l/ 2 
r + - — r 
2 m 



x c l{r) 



t — t 2 m 



(t - t ) + ^ r 



m 



and the Lagrangian along the path, considered as a function of r, is 



9(t) = ^m± 2 cl (r) + fx d (r) 

1 f X - X 1 / . , 

= -"1 — (t — t ) 

2 \t - t 2m y ' 
1 / 2 2 , , j, ( x — X Q 



f 



1 / 



t — t Q 2 m 



(t -t ) T 



2 m \ t — t 2 m 



1 / „ ^ \ _ , 1 / 2 _2 



(* - *o) r + - - r- 
2 m 



1 



-m I " 

2 V t - t 2 m 



(t - t ))T 



+ — r z 
m 



fx 



One obtains for the action integral along the classical path 

S[x c i(t)} = [ dr g{r) 
Jt 



1 ( x — x c 
-m 

2 V t - to 



2 m 



(t - to) (t - to) 



+ f 



1 / 



t — t Q 2 m 



(t - Q (t - t )' 



1 f 2 

+ - J —(t- t f + Xo f(t - t ) 

3 m 

l m <^ + !(, + *)/(.-« -!£(«-«.)■ 



and, finally, for the propagator 
4>(x,t\x ,t ) -- 



x exp 



m 

2irih(t — t Q 

im (x — x ) 2 i , x „ , 

-r ~ — + — (x + X ) f (t - t ) 

2h t - to 2h y ' J v ' 



24 hm 



(t ~ t f 



(2.108) 



(2.109) 



(2.110) 



(2.111) 



(2.112) 



(2.113) 
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The propagator (2.113) allows one to determine the time-evolution of the initial state (2.48) using 
(2.5). Since the propagator depends only on the time-difference t — 1 we can assume, withoult loss 
of generality, t Q = and are lead to the integral 



ijj(x,t) 
exp 



1 


4 


" m ' 


_ir5 2 _ 




.2iriht. 



+oo 



im (x — xo) z 
2h 



t 



25 2 



CO 

Po 

h 



dxr 



■>■,, ■ .' ljt (•'• • ■''„)// — — r' 



(2.114) 



f2 

24 mh 



E (x ,x) + E(x) 

To evaluate the integral we adopt the same computational strategy as used for (2.51) and divide 
the exponent in (2.114) as follows [c.f. (2.54)] 

f^ 



Eq{x ^ x) — 



vm 
E(x) 



1 + i 

im 
2ht 



ht 

m5 2 



— 2x I x — 



^° t J " ) + fix] 
m 2m 



2 ft' 

x 2 + - — X 
m 



24 km 



One chooses then f(x) to complete, according to (2.52), the square in (2.115) 

2 

n . n- \ 

X 



2m 



This yields 



Eo(x 0l x) 



im 
2M 



x. 



l+i 



ht 



X m 1 2rn 



m- 



l+i 



ht 

mS 2 



Following in the footsteps of the calculation on page 18 ff. one can state again 



ip(x,t) 



and is lead to the exponential (2.116) 



"l _ j « " 

m8 2 


4 


1 


4 






[^ 2 (1+S). 





1 f 2 t 3 

E{x) '. 



im 



where 



S(x) = 



. ht 

m5 2 



24 hm ' 2ht{l + i^ z ) 
ft 2 



exp [ E(x) ' 



S(x) 



m 



ht 

m5 2 



= x 



+ 



Po 



2\ 2 



t 



ft 2 \ 



m 
ft 2 

x + 2x 

m 



2m ) 



ht 



m 



x 



m 2m 



Po 

m 









yx 




m 


Po 


t - 


. fJL 


m 


2m 


f_l 


)" 





2m 





2\2 

2m 



ft 



2\ 2 



ht 

m5 2 



(2.115) 
(2.116) 

(2.117) 



(2.118) 



(2.119) 



(2.120) 



(2.121) 
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Inserting this into (2.120) yields 

( 



E(x) 



x - ^t 

m 



£t_X 

2m J 



(2.122) 



+ ^{Po + ft)x 



2mh 



Pot + Poft 2 + 



Pi 6 ft 3 



+ 



12 



The last term can be written 



Pot + Poft 2 + ^ 



2mh \ ' 3 / 2mh 

Altogether, (2.119, 2.122, 2.123) provide the state of the particle at time t > 



x exp 



mo* 


4 


1 


4 






[^ 2 (1+S). 





_ 2° + _ /*!V 



ht 

m5 2 



x exp 



h {Po + ft)X -h i dT 2m 



(2.124) 



The corresponding probablity distribution is 

\i>(x,t)\ z = 1 k2t2 



exp 



2m J 



« 2 (i + 



(2.125) 



Comparision of Moving Wave Packet with Classical Motion 

It is again [c.f. (2.4)] revealing to compare the probability distributions for the initial state (2.48) 
and for the states at time t, i.e., (2.125). Both distributions are Gaussians. Distribution (2.125) 
moves along the x-axis with distribution centers positioned at y(t) given by (2.103), i.e., as expected 
for a classical particle. The states (2.124), in analogy to the states (2.71) for free particles, exhibit a 
phase factor exp[ip(t)x/h], for which p(t) agrees with the classical momentum (2.104). While these 
properties show a close correspondence between classical and quantum mechanical behaviour, the 
distribution shows also a pure quantum effect, in that it increases its width . This increase, for 
the homogeneous force case, is identical to the increase (2.73) determined for a free particle. Such 
increase of the width of a distribution is not a necessity in quantum mechanics. In fact, in case of so- 
called bound states, i.e., states in which the classical and quantum mechanical motion is confined to 
a finite spatial volume, states can exist which do not alter their spatial distribution in time. Such 
states are called stationary states. In case of a harmonic potential there exists furthermore the 
possibility that the center of a wave packet follows the classical behaviour and the width remains 
constant in time. Such states are referred to as coherent states, or Glauber states, and will be 
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studied below. It should be pointed out that in case of vanishing, linear and quadratic potentials 
quantum mechanical wave packets exhibit a particularly simple evolution; in case of other type of 
potential functions and, in particular, in case of higher-dimensional motion, the quantum behaviour 
can show features which are much more distinctive from classical behaviour, e.g., tunneling and 
interference effects. 



Propagator of a Harmonic Oscillator 

In order to illustrate the evaluation of (2.102) we consider the case of a harmonic oscillator. In 
this case holds for the coefficents in the Lagrangian (2.80) c(t) = muo 2 and e(t) = 0, i.e., the 
Lagrangian is 

1 q 1 O O / \ 

L(x,x) = -mx — -mui x . (2.126) 



.We determine first /(to,t)- In the present case holds 

/ = -oj 2 f; /(t , to) = 0; f(t ,t ) = 1. 
The solution which obeys the stated boundary conditions is 

sinu;(t — to) 



/(to,*) 



We determine now S[x c i(t)]. For this purpose we seek first the path x c i{t) which obeys x c /(to 
and x c i{t) = x and satisfies the Euler-Lagrange equation for the harmonic oscillator 

mxd + mu) 2 x c \ = . 

This equation can be written 

X d = -0J 2 Xd . 

the general solution of which is 

%ci(t') = ^4sinw(r — to) + B cosu;(r — to) • 
The boundary conditions £ c /(to) = xo and x c i(t) = x are satisfied for 

_ . X — Xq cosuj(t — t Q ) 

B = x ; A = : — , 

sinw(i — to) 

and the desired path is 

%ci(t) = — sino;(r — to) + xocoscj(r — to) 

where we introduced 

c = cosLij(t — t ) , s = sincj(t — t Q ) 
We want to determine now the action integral associated with the path (2.133, 2.134) 

r-t 



S[xci(t)\ = £ dr Qmx^(r) - X -mu?x 2 cl {T) 



2.127) 

2.128) 

= x 

2.129) 
2.130) 

2.131) 
2.132) 

2.133) 
2.134) 

2.135) 
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For this purpose we assume presently t D = 0. From (2.133) follows for the velocity along the 
classical path 

X c i(t) 



X — XqC 
LO COSUJT — UJXq S1HUT 

s 



and for the kinetic energy 



1 



mx 2 cl {j) 



1 2 (x - X c) 2 2 
-muo k COS LOT 



o X — XqC 

—muo x Q cosujt smuT 



+ -muo 2 x 2 sin 2 o;r 



Similarly, one obtains from (2.133) for the potential energy 

(x — Xq cY 



1 2 2 / \ 1 2 

-mw x c1 {t) = -moo 



sin LOT 



2 X — Xo C 

-\-muo x cosuot smuoT 



+ -171UJ 2 X 2 COS 2 LOT 



Using 

1 

2 
1 

2 
1 

2 

the Lagrangian, considered as a function of r, reads 
9(t) = \mx 2 cl {r) - ^mu; 2 x 2 d (T) = 



COS 2 00T 



sin 2 u;r 



coslot smuoT 



cos2o;r 



cos2o;r 



2 

sin2u;r 



1 o (x — xqc) 
-muo ^ 



cos2cjt 



2 X - X C 

-muo x sm2o;r 



(2.136) 



(2.137) 



(2.138) 

(2.139) 
(2.140) 
(2.141) 



-moo 2 x 2 cos2lot 



(2.142) 



Evaluation of the action integral (2.135), i.e., of 5[x c /(r)] = /^dr^r) requires the integrals 



/' 

J o 



cLtcos2lot 



r-t 



2lo 
1 



sm2uot 



s c 



LO 



1 



/ dTsm2ooT = — [1 — cos2a;il = — s~ 
Jo 2lo 1 lo 

„2 i „2 



(2.143) 
(2.144) 



where we employed the definition (2.134) Hence, (2.135) is, using s + c = 1, 

1 (x - X c) 2 X - XQC 2 1 2 2 

>j[x c i(t)\ = -moo t, sc — mL0x o s — -muo x sc 



2 

muo 

~2s 
muo 

~2~7 



[ (x 2 — 2xx Q c + x 2 c 2 ) c — 2x Q xs 2 + 2x 2 cs 2 — x 2 s 2 c] 

[{x 2 + x 2 )c - 2x Q x] (2.145) 
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and, with the definitions (2.134), 



S[x c i(t)} 



2sinu;(t — to) 

For the propagator of the harmonic oscillator holds then 

4>(x,t\xo,t ) 



[(xq + x 2 ) cosuj(t — to) — 2xqx] 



(2.146) 



2Trihs'mu)(t — to) 

ex p{ H x + ^)^(t - t ) - 2x x] } 



(2.147) 



Quantum Pendulum or Coherent States 



As a demonstration of the application of the propagator (2.147) we use it to describe the time 
development of the wave function for a particle in an initial state 



muj 
irh 



exp 



mu)(xo — b o y 
2h 



+ ^VoXo 



(2.148) 



The initial state is decribed by a Gaussian wave packet centered around the position x = b Q and 
corresponds to a particle with initial momentum p . The latter property follows from the role of 
such factor for the initial state (2.48) when applied to the case of a free particle [c.f. (2.71)] or to 
the case of a particle moving in a homogeneous force [c.f. (2.124, 2.125)] and will be borne out of 
the following analysis; at present one may regard it as an assumption. 

If one identifies the center of the wave packet with a classical particle, the following holds for the 
time development of the position (displacement), momentum, and energy of the particle 



b(t) 
P (t) 



Po 

b Q coscj(t — t Q ) H — sinu;(t — t G ) displacement 

— mu)b sma>(t — t Q ) + p Q cos uj(t — t Q ) momentum 

energy 



(2.149) 



2m 



+ \muj 2 b 2 n 



We want to explore, using (2.5), how the probability distribution \ip(x, t)\ 2 of the quantum particle 

propagates in time. 

The wave function at times t > to is 



/oo 
dx (f)(x,t\x ,t )if;(x ,to) 
-oo 

Expressing the exponent in (2.148) 



2Hs'muj(t — t ) 
(2.147, 2.150, 2.151) can be written 

Y>(x,t) 



i (x — b ) 2 sina;(t — t ) + — - x sincj(t — t Q ) 

muj 



(2.150) 



(2.151) 
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27ricohsmuj(t — to) 



dxo exp [Eq + E ] 



(2.152) 
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where 



£^0(^0,2;) = 
E(x) = 



imuo 
2hs 



2 Po 



muj 



x D s + f(x) 



zmuj 
2hs 



[x 2 c + isbl - f(x)] . 



(2.153) 
(2.154) 

(2.155) 



c = cos u(t — t Q ) , s = sinu;(t — t D ) . 

Here f(x) is a function which is introduced to complete the square in (2.153) for simplification of 
the Gaussian integral in xq. Since E(x) is independent of x Q (2.152) becomes 



ip(x,t) 



~muj~ 


1 
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. ttH - 





m 



2Triu)hsmu)(t — to) 
We want to determine now E (x o ,x) as given in (2.153). It holds 



e E{x) / dx exj>[E (x o ,x)} 



E n 



imuj 
2hs 



c 2 e Mt-t ) _ 2Xq(x + isbo _P^ S ^ + f, x j 



For f(x) to complete the square we choose 

f{x) = (X + isb - S) 2 e -<"(*-*») . 

One obtains for (2.157) 



E (x o ,x) = 



imco 



Po 

. exp \iu(t — to)] xq — (x + isb s) exp (— iojit — to) 

2ns V mu 



To determine the integral in (2.156) we employ the integration formula (2.247) and obtain 



+00 



dxn e E °( Xo) 



2irihs'muj(t — to) 
muj exp[iuj(t — to)] 



Inserting this into (2.156) yields 



ip(x,t) 



muj 



3 e E{x) 



(2.156) 

(2.157) 
(2.158) 
(2.159) 

(2.160) 
(2.161) 



For E(x) as defined in (2.154) one obtains, using exp[±iu(t — t )] = c ± is, 

E(x) = ^ [x 2 c + isb 2 Q - x 2 c + isx 2 - 2isxb c - 2s 2 xb Q 
+ s 2 b 2 Q c - is 3 b 2 Q + 2^xsc + 2i^ b s 2 c 

- 2i ^ xs 2 + 2 b s 3 - -4^s 2 c + i^ s 3 

[x 2 + c 2 b 2 — 2xb c + 2ixsb — ib 2 sc 

2 2°, xs + 2 b Q sc + * 2 - ^ 



2?i 



e 

raw - ■ m 2 w 2 



2i -Er- xc 



-2i ^ h Q s 2 + i 



sc 



- - lrnu;b 2 )sc + -^pAs 2 



(2.162) 
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We note the following identities 

2m 



/ 

Jt, 



1 . 1 / pi mujb1\ 1, 9 , 

to(t -*o)+o U^ - "V M c - o W (2.163) 



/ 



to 



2 v 2 \2rnu; 2 / 2 

mo; 2 6 2 (T 



= 9 " *o) - 9 ^ ^ + 6 p oS 2 (2.164) 

2 2 \2tou; 2 / 2 

where we employed 6(r) and p(r) as defined in (2.149). From this follows, using p{r) = mb(r) and 
the Lagrangian (2.126), 

J*drL[b(T),b(T)] = " ^) ^ - 6 oPoS (2.165) 

such that E(x) in (2.162) can be written, using again (2.149)), 

E(x) = _^[x-6(*)] 2 + - *^(*-* ) - \ fdrL[b(T),b(r)] (2-166) 

Inserting this into (2.161) yields, 

= E] 1 x ^ - ^ [* - &(*)] 2 } x ( 2 - 167 ) 

xexp|^p(t)x - iiw(t-t ) - ^ jf drL[6(T),6(T)]| 

where 6(t), p(i), and e Q are the classical displacement, momentum and energy, respectively, defined 
in (2.149). 



Comparision of Moving Wave Packet with Classical Motion 

The probability distribution associated with (2.167) 

exp{-^[*-^)] 2 } (2.168) 

is a Gaussian of time-independent width, the center of which moves as described by b(t) given in 
(2.148) , i.e., the center follows the motion of a classical oscillator (pendulum) with initial position 
b and initial momentum p Q . It is of interest to recall that propagating wave packets in the case of 
vanishing [c.f. (2.72)] or linear [c.f. (2.125)] potentials exhibit an increase of their width in time; in 
case of the quantum oscillator for the particular width chosen for the initial state (2.148) the width, 
actually, is conserved. One can explain this behaviour as arising from constructive interference due 
to the restoring forces of the harmonic oscillator. We will show in Chapter 4 [c.f. (4.166, 4.178) and 
Fig. 4.1] that an initial state of arbitrary width propagates as a Gaussian with oscillating width. 



irh 
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In case of the free particle wave packet (2.48, 2.71) the factor exp(ip x) gives rise to the transla- 
tional motion of the wave packet described by p t/m, i.e., p also corresponds to initial classical 
momentum. In case of a homogeneous force field the phase factor exp(ip Q x) for the initial state 
(2.48) gives rise to a motion of the center of the propagating wave packet [c.f. (2.125)] described by 
(p /m)t + ^ft 2 such that again p Q corresponds to the classical momentum. Similarly, one observes 
for all three cases (free particle, linear and quadratic potential) a phase factor exp[ip(t)x/h] for the 
propagating wave packet where ip(i) corresponds to the initial classical momentum at time t. One 
can, hence, summarize that for the three cases studied (free particle, linear and quadratic potential) 
propagating wave packets show remarkably close analogies to classical motion. 
We like to consider finally the propagation of an initial state as in (2.148), but with b Q = and 
p = 0. Such state is given by the wave function 



muj 



\ ( mujx 2 , iuj 



where we added a phase factor exp(— iut /2). According to (2.167) the state (2.169) reproduces 
itself at later times t and the probablity distribution remains at all times equal to 



muj 



i.e., the state (2.169) is a stationary state of the system. The question arises if the quantum 
oscillator posesses further stationary states. In fact, there exist an infinite number of such states 
which will be determined now. 

2.7 Stationary States of the Harmonic Oscillator 

In order to find the stationary states of the quantum oscillator we consider the function 



W(x,t) = exp^^se-™ - e" 2 ^ - ^ x 2 - . (2.171) 

We want to demonstrate that w(x,t) is invariant in time, i.e., for the propagator (2.147) of the 
harmonic oscillator holds 

/+oo 
dx <f>(x, t\x Q , t Q ) W(x , t ) . (2.172) 
-oo 

We will demonstrate further below that (2.172) provides us in a nutshell with all the stationary 
states of the harmonic oscillator, i.e., with all the states with time-independent probability distri- 
bution. 

In order to prove (2.172) we express the propagator, using (2.147) and the notation T = t — t Q 



(f)(x,t\xo,to) = e~^ iu>T 



moo 



7rh(l - e~ 2i " T ) 



x exp 



mco , 9 9 . 1 + e mu 2xx e 

(x + x 



(2.173) 



2h ' ~ ' 1 - e- 2 ^ T hco 1 - e- 2 ^ T 
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One can write then the r.h.s. of (2.172) 



where 



/ = e-3*"* 



_vrft(l - e- 2 ^ T ) 



/+oo 
dx exp[E (x , 
-co 



■r) + E(: 



»] 



muj 
~2h 



2 / 1 + e -2io,r 



Following the 



1 \ 

. e -2iwT J 

n ( 2xe-™ T n nr 

\ 1 — e V mw 

rv ft mo; [ 2 1 + e" 2 ^ T 2ft _ 2iwi 
-Bfx) = z 2 H e 2laJ *° - /(x 

? by now familiar strategy one choses f(x) to complete the 

/(*) = \^~ *- 2wT ) [p^Jr + 2i/A e -^ 
This choice of f(x) results in 



(2.174) 
(2.175) 

- jyx, (2.176) 
ete the square in (2.175), namely, 

— e -i"to\ _ (2.177) 
mijj / 



+ /(x) 



rtiio 

~ ~2h 



Xq 



1 _ e -2iwT 



+ 



1 1 - e~ 2i " T ( 2xe~^ T /~h 
1 - e- 2 ^ T + 2 Vto 



n 2 



e -iu)t 



WILO 

1 ^( e -2^T 



— (X + Z D ) 2 



for some constant z Q G C. Using (2.247) one obtains 

/+oo 
dXoe Eo(x ,x) = 
-oo 

and, therefore, one obtains for (2.174) 
For E(x), as given in (2.176, 2. 



raw 



/ = e - : 



.177), holds 



(2.178) 

(2.179) 
(2.180) 



1 + e -2^T 
X 1 — 



__ + 2ft e _ 2ia)to _ 2x 2 e- 2 ^ 

e -2iwT m(jJ 1 — r— ^ 



e -2iwT 

-[JJL xe -w _ 2 (1 - fT 2 - T ) A e -2-*o 
mw raw 
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Altogether, one obtains for the r.h.s. of (2.172) 



/ = exp^^xe-*"* - e~ 2 ^ - ^ x 1 - ^iut ) . (2.182) 
Comparision with (2.171) concludes the proof of (2.172). 

We want to inspect the consequences of the invariance property (2.171, 2.172). We note that the 
factor exp(2y/muj/h xe~ tujt — e ~ 2lult ) in (2.171) can be expanded in terms of e ~ mujt : n = 1,2,.... 
Accoordingly, one can expand (2.171) 

OO j 

W(x,t) = exp[-iw(n+ i)t] 4> n (x) (2.183) 

n=0 n ' 

where the expansion coefficients are functions of x, but not of t. Noting that the propagator (2.147) 
in (2.172) is a function of t — t Q and defining accordingly 

$(x,x ;t-t ) = <f>(x,t\x ,t ) (2.184) 

we express (2.172) in the form 

OO j 

^ — exp[-iw(n + |) t] 4> n {x) 

n=0 n - 

= / dx $(x,x ;t - t ) — exp[-io;(m+ ±)i ] 4> m (x ) (2.185) 
Replacing i — > t + t yields 



n=0 



OO j 

/ c/x <I>(x,x ;i) — - exp[-zw(m + |) t c ] ^ m (x ) (2.186) 

m=0 J ~°° m - 

Fourier transform, i.e., dt exp[iw(n + \)t ] ■ ■ ■ , results in 

exp[-iuj(n + ±) t] (j) n (x) 

/+oo 
dx $(x, x G ; t - t ) — <Mx ) (2.187) 
-oo n - 



-■OO 

1 

n! 



or 



exp[-iu;(n + §) i] 0„(x) 

/+oo 
dx </>(x,i|x ,i ) exp[-iw(n+ §)t ] 0„(x o ) . (2.188) 
-OO 

Equation (2.188) identifies the functions ip n (x, t) = exp[— iuj(n+ §) t] 4> n (x) as invariants under the 
action of the propagator </>(x, t\x , t Q ). In contrast to W(x,t), which also exhibits such invariance, 
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the functions ij) n {x,t) are associated with a time-independent probablity density \ip n (x,t)\ 2 = 
|^ n (x)| 2 . Actually, we have identified then, through the expansion coefficients 4> n (x) m (2.183), 
stationary wave functions ip n (x, t) of the quantum mechanical harmonic oscillator 

i> n (x, t) = exp[-iuj(n + \) t] N n 4> n (x) , n = 0, 1, 2, . . . (2.189) 

Here N n are constants which normalize ip n (x,t) such that 

/+oo f+OO 
dx\ij{x,t)\ 2 = N 2 dx(f>l(x) = 1 (2.190) 
-oo J — oo 

is obeyed. In the following we will characterize the functions (f) n (x) and determine the normalization 
constants N n . We will also argue that the functions ip n (x,t) provide all stationary states of the 
quantum mechanical harmonic oscillator. 



The Hermite Polynomials 

The function (2.171), through expansion (2.183), characterizes the wave functions (j> n (x). To obtain 
closed expressions for (f) n (x) we simplify the expansion (2.183). For this purpose we introduce first 
the new variables 



^x (2.191) 
z = e~ iujt (2.192) 

and write (2.171) 

W(x,t) = zh- y2/2 w(y,z) (2.193) 

where 

w(y,z) = exp(2yz - z 2 ) . (2.194) 

Expansion (2.183) reads then 

oo n 

w(y,z)zh- y2 / 2 = z^ ^ — My) (2.195) 



n! 

n=0 



or 

oo 

Z 



where 



w[.,, 

n=0 



H n (y) = e y/ 2 4> n (y). (2.197) 

The expansion coefficients H n {y) in (2.197) are called Hermite polynomials which are polynomials 
of degree n which will be evaluated below. Expression (2.194) plays a central role for the Hermite 
polynomials since it contains, according to (2.194), in a 'nutshell' all information on the Hermite 
polynomials. This follows from 

gn 



= H n (y) (2.198) 

z=0 
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which is a direct consequence of (2.196). One calls w(y, z) the generating function for the Hermite 
polynomials. As will become evident in the present case generating functions provide an extremely 
elegant access to the special functions of Mathematical Physics 3 . We will employ (2.194, 2.196) to 
derive, among other properties, closed expressions for H n (y), normalization factors for (f)(y), and 
recursion equations for the efficient evaluation of H n {y). 

The identity (2.198) for the Hermite polynomials can be expressed in a more convenient form 
employing definition (2.196) 



QU 

dz 1 



w(y,z) 



2=0 



gn 

dz r 



2yz-z' 



2=0 



gn 

~dz~" 



2 d n 
v ; dy n 



(-ire 



u 



gr, 



2=0 

,2 



-y 



(2.199) 



2=0 dy n 

Comparision with (2.196) results in the so-called Rodrigues formula for the Hermite polynomials 



(2.200) 



One can deduce from this expression the polynomial character of H n (y), i.e., that H n (y) is a 
polynomial of degree n. (2.200) yields for the first Hermite polynomials 



H Q (y) = 1, Hx{y) = 2y, H 2 (y) = 4y 2 - 2, H 3 (y) = 8y 3 - 12y, . 



(2.201) 




Figure 2.1: Schematic representation of change of summation variables v and juton = v + u, and 
m = v — jjL. The diagrams illustrate that a summation over all points in a v, ii lattice (left diagram) 
corresponds to a summation over only every other point in an n, m lattice (right diagram). The 
diagrams also identify the areas over which the summation is to be carried out. 

We want to derive now explicit expressions for the Hermite polynomials. For this purpose we expand 
the generating function (2.194) in a Taylor series in terms of y p z q and identify the corresponding 
coefficient c m with the coefficient of the p-th power of y in H q (y). We start from 

z 2 »(-\y{2y) v -»z v -» 

(-iy(2y) v -»z v+ » (2.202) 



3 generating functionology by H.S.Wilf (Academic Press, Inc., Boston, 1990) is a useful introduction to this tool as 
is a chapter in the eminently useful Concrete Mathematics by R.L.Graham, D.E.Knuth, and O.Patashnik (Addison- 
Wesley, Reading, Massachusetts, 1989). 



00 v 1 / \ 

-ELM 

jy=0ft=0 \ ^ / 

00 V 1 / \ 
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and introduce now new summation variables 

n = v + fi , m = v — [x 0<n<oo,0<m<n. (2.203) 

The old summation variables [i expressend in terms of n, m are 

n + m n — m , 

v = — ^— , li = — ^— . (2.204) 

Since v, fx are integers the summation over n, m must be restricted such that either both n and m 
are even or both n and m are odd. The lattices representing the summation terms are shown in 
Fig. 2.1. With this restriction in mind one can express (2.202) 



e 



2yz — z 



oo^ ^ n |n ^^-q^ 



_ 

n n! ^ ! ™ ! 

n=0 m>0 \ J 



"M = £ rfe^ ^r 2k ■ (2-207) 



Since (n — m)/2 is an integer we can introduce now the summation variable k = (n — m)/2 , < 
k < [n/2] where [x] denotes the largest integer p, p < x. One can write then using m = n — 2k 

e 2y 2 -^ = y * y M (2y)"- 2fc . (2.206) 

^ n! ^ k\(n-2k)\ y y > y ' 

n=0 k=0 V 7 

V „ ' 

= »»&/) 

From this expansion we can identify H n (y) 

[n/2] 

k\ (n-2k)\ 

This expression yields for the first four Hermite polynomials 

H (y) = 1, H^y) = 2y, H 2 (y) = Ay 2 - 2, H 3 (y) = 8y 3 - 12y, . . . (2.208) 
which agrees with the expressions in (2.201). 

From (2.207) one can deduce that H n (y), in fact, is a polynomial of degree n. In case of even n , the 
sum in (2.207) contains only even powers, otherwise, i.e., for odd n, it contains only odd powers. 
Hence, it holds 

Hn(-y) = (-l) n H n (y) . (2.209) 

This property follows also from the generating function. According to (2.194) holds w(—y,z) = 
w(y, —z) and, hence, according to 2.197) 

£ h Hn{ - y) = £ ^r F - (y) = £ h (2-2io) 

n=0 n=0 n=0 

from which one can conclude the property (2.209). 
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The generating function allows one to determine the values of H n (y) at y = 0. For this purpose one 
considers w(0, z) = exp(— z 2 ) and carries out the Taylor expansion on both sides of this expression 
resulting in 

ra=0 n=0 

Comparing terms on both sides of the equation yields 

#2n(0) = (-l) n{2n ^, H 2n+1 (0) = 0, n = 0,l,2,... (2.212) 
n! 

This implies that stationary states of the harmonic oscillator (f>2 n +i(x), as defined through (2.188, 
2.197) above and given by (2.233) below, have a node at y = 0, a property which is consistent 
with (2.209) since odd functions have a node at the origin. 

Recursion Relationships 

A useful set of properties for special functions are the so-called recursion relationships. For Hermite 
polynomials holds, for example, 

H n+1 (y) - 2yH n (y) + 2nH n ^(y) = 0, n = 1,2,... (2.213) 

which allow one to evaluate H n {y) from Ho(y) and H\{y) given by (2.208). Another relationship is 

^H n (y) = 2ntf n _!(y), n=l,2,... (2.214) 

We want to derive (??) using the generating function. Starting point of the derivation is the 
property of w(y, z) 

|^(y,z) - {2y - 2z)w(y,z) = (2.215) 

which can be readily verified using (2.194). Substituting expansion (2.196) into the differential 
equation (2.215) yields 

00 n— 1 00 n 00 n+l 

E ^ Hn{y) ~ 2y ^h. Hn{y) + 2 £ V = • (2 - 216) 

n=l ^ n=0 ' n=0 

Combining the sums and collecting terms with identical powers of z 

00 n 

E ~T - 2j/fl- n (y) + 2nfl- n _i(y)l + ^(y) - 2yH (y) = (2.217) 



n! 

71=1 



gives 

#i(y) - 2yi^ (y) = 0, H n+1 (y) -2yH n {y) + 2nH n ^{y) = 0, n=l,2,... (2.218) 

The reader should recognize the connection between the pattern of the differential equation (??) 
and the pattern of the recursion equation (??): a differential operator d/dz increases the order n 
of H n by one, a factor z reduces the order of H n by one and introduces also a factor n. One can 
then readily state which differential equation of w (y, z) should be equivalent to the relationship 
(??), namely, dw/dy — 2zw = 0. The reader may verify that w(y,z), as given in (2.194), indeed 
satisfies the latter relationship. 
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Integral Representation of Hermite Polynomials 

An integral representation of the Hermite polynomials can be derived starting from the integral 

/+oo 

dte 2iyt ~ t2 . (2.219) 
-oo 

which can be written 

/+oo /*+oo 

dte -(t-iy) 2 = e -v 2 / dze -z 2 . (2.220) 
-oo J —oo 

Using (2.247) for a = i one obtains 

I(y) = ^e~ y2 (2.221) 

and, acording to the definition (2.226a), 

-I r+oo 

e~ y2 = —= \ dte 2iyt ~ t2 . (2.222) 



Employing this expression now on the r.h.s. of the Rodrigues formula (2.200) yields 



/ i\n r+oo in 

H n (y) = K -^- e y2 / dt e ^yt-t- (2 223) 



The identity 

^_ e 2iyt-t* = (2it) n e 2iyt ~ t2 (2.224) 
results, finally, in the integral representation of the Hermite polynomials 

on (_j\n y 2 r+oc 

H n (y) = 1 > 6 / dt^e 2 *"*"* , n = 0,1,2,... (2.225) 

V 71 " J-oo 

Ort honor mality Properties 

We want to derive from the generating function (2.194, 2.196) the orthogonality properties of the 
Hermite polynomials. For this purpose we consider the integral 

/+oo 
dye~^ y ~ z ' z '^ = ^e 2zz ' 
-oo 

°° on ~n Jn 

= ^ E • (2 - 226) 

n=0 

Expressing the l.h.s. through a double series over Hermite polynomials using (2.194, 2.196) yields 

°° r+oo n In' °° 7 n In 

E / dyH n (y)HAy)e- y ^ = £ Tnl ^ — (2.227) 

ra,n'=0 •' x n II 

Comparing the terms of the expansions allows one to conclude the orthonormality conditions 

r+oo 



\ dyH n (y)H n ,(y)e~y =2 n n\^b nn ,. (2.228) 

J — oo 
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Figure 2.2: Stationary states (f) n (y) of the harmonic oscillator for n = 0, 1,2,3,4. 

Normalized Stationary States 

The orthonormality conditions (2.228) allow us to construct normalized stationary states of the 
harmonic oscillator. According to (2.197) holds 



My) = e-y 2 / 2 H n { y ). 

The normalized states are [c.f. (2.189, 2.190)] 

Mv) = N n e- y2 / 2 H n (y) . 
and for the normalization constants N n follows from (2.228) 

/ + 0O 
dye^ H 2 n (y) = N 2 2 n n\^ = 1 
-oo 

We conclude 

1 

N n = 



and can finally state the explicit form of the normalized stationary states 

1 



Mv) = 



e~y 2 / 2 H n (y). 



(2.229) 
(2.230) 

(2.231) 
(2.232) 

(2.233) 



The stationary states (2.233) are presented for n = 0,1,2,3,4 in Fig. 2.2. One can recognize, in 
agreement with our above discussions, that the wave functions are even for n = 0,2,4 and odd 
for n = 1,3. One can also recognize that n is equal to the number of nodes of the wave function. 
Furthermore, the value of the wave function at y = is positive for n = 0,4, negative for n = 2 
and vanishes for n = 1, 3, in harmony with (??). 
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The normalization condition (2.231) of the wave functions differs from that postulated in (2.189) 
by the Jacobian dx/dy, i.e., by 



/ dx 




"mar 


1 dy 




l h \ 



(2.234) 



The explicit form of the stationary states of the harmonic oscillator in terms of the position variable 
x is then, using (2.233) and (2.189) 



<t>n{x) 



1 



V2 n n\ 



muj 



e 2 ft H n { 



x). 



(2.235) 



Completeness of the Hermite Polynomials 

The Hermite polynomials are the first members of a large class of special functions which one 
encounters in the course of describing stationary quantum states for various potentials and in spaces 
of different dimensions. The Hermite polynomials are so-called orthonogal polynomials since they 
obey the conditions (2.228). The various orthonogal polynomials differ in the spaces Q C R over 
which they are defined and differ in a weight function w(y) which enter in their orthonogality 
conditions. The latter are written for polynomials p n { x ) m the general form 

dx p n (x) p m (x) w(x) = I n 5 nm (2.236) 

where w(x) is a so-called weight function with the property 

w(x) > 0, w(x) = only at a discrete set of points Xk G ft (2.237) 

and where /„ denotes some constants. Comparision with (2.228) shows that the orthonogality 
condition of the Hermite polynomials is in complience with (2.236 , 2.237) for fi = R, w(x) = 
exp(— x 2 ), and /„ = 2 n n!y / 7r. 

Other examples of orthogonal polynomials are the Legendre and Jacobi polynomials which arise 
in solving three-dimensional stationary Schrodinger equations, the ultra- spherical harmonics which 
arise in n-dimensional Schrodinger equations and the associated Laguerre polynomials which arise 
for the stationary quantum states of particles moving in a Coulomb potential. In case of the 
Legendre polynomials, denoted by Pi(x) and introduced in Sect. 5 below [c.f. (5.150 , 5.151, 5.156, 
5.179] holds ft = [—1,1], w(x) = 1, and Ig = 2/(2^ + 1). In case of the associated Laguerre 
polynomials, denoted by L^\x) and encountered in case of the stationary states of the non- 
relativistic [see Sect. ??? and eq. ???] and relativistic [see Sect. 10.10 and eq. (10.459] hydrogen 
atom, holds 17 = [0, +oo[, w(x) = x a e~ x , I n = T{n + a + l)/n\ where T(z) is the so-called Gamma 
function. 

The orthogonal polynomials p n mentioned above have the important property that they form a 
complete basis in the space T of normalizable functions, i.e., of functions which obey 

/ dx f{x) w(x) = < oo , (2.238) 
Jn 

where the space is endowed with the scalar product 

(f\9) = [ dx f(x)g(x) w(x) = < oo , f,gef. (2.239) 
Jn 



./c 
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As a result holds for any / G T 

f(x) = CnPn(x) (2.240) 

n 

where 

c n = ±- f dx w{x) f{x) Pn {x) . (2.241) 
I n Jn 

The latter identity follows from (2.236). If one replaces for all / £ f : f(x) — ► y/w(x) f(x) and, 
in particular, p n (x) — > -y/w;(x)p n (:z;) the scalar product (2.239) becomes the conventional scalar 
product of quantum mechanics 

</|<7> = / dxf(x)g(x) . (2.242) 
Jn 

Let us assume now the case of a function space governed by the norm (2.242) and the existence of a 
normalizable state ip(y, t) which is stationary under the action of the harmonic oscillator propagator 
(2.147), i.e., a state for which (2.172) holds. Since the Hermite polynomials form a complete basis 
for such states we can expand 

oo 

i/>(y,t) = J2 c n(t)e- y2/2 H n (y) . (2.243) 

?1=0 

To be consistent with(2.188, 2.197) it must hold c n {t) = d n exp[— iuj{n + ^)t] and, hence, the 
stationary state ip(y, t) is 

OO 

1>(y, t) = J2 d n exp[-iw(n + -)t] I 2 H n (y) . (2.244) 

n=0 

For the state to be stationary \if;(x,t)\ 2 , i.e., 

oo 

J2 d* n d m exp[iu,(m - n)t] e~ y2 H n (y)H m (y) , (2.245) 

n,m=0 

must be time-independent. The only possibility for this to be true is d n = 0, except for a single 
n = n Q , i.e., ip(y,i) must be identical to one of the stationary states (2.233). Therefore, the states 
(2.233) exhaust all stationary states of the harmonic oscillator. 



Appendix: Exponential Integral 

We want to prove 



+oo +oo 



/ = Jd Vl ... jdy n e'^WW = J^^y (2-246) 

— oo — oo 

for det(a) / and real, symmetric a, i.e. a T = a. In case of n = 1 this reads 

/* +0 ° ■ 2 Utt 

/ dxe tax = J— , (2.247) 
7-oo V a 
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which holds for a S C as long as a / 0. 

The proof of (2.246) exploits that for any real, symmetric matrix exists a similarity transformation 
such that 

/an ... \ 



S^aS = a 



a 2 2 • 



\ . 







O'nn ) 



(2.248) 



where S can be chosen as an orthonormal transformation, i.e., 

S T S = 11 or S = S" 1 



(2.249) 



The afcfc are the eigenvalues of a and are real. This property allows one to simplify the bilinear 
form J2]k yj a jkUk by introducing new integration variables 



Vj = X^ 5 ^okVk ; yk = ^Skjjjj . 



(2.250) 



The bilinear form in (2.246) reads then in terms of jjj 

n n 

YTj^Vj a 3kVk =^ ^ ytSjiajkSkmVm 
j,k im 
n n 

= ^ ^ ye(S T )ija jk S km y ri 

j,k im 
n 

= yfiikVk 
j,k 

where, according to (2.248, 2.249) 



(2.251) 



For the determinant of a holds 



as well as 



®3 k — ' S y^ J {S T ) jl a lmS m k ■ 
l,m 



dei(a) = Y\ Ujj 

3=1 



det(a) = de*(S -1 aS) = det(S -1 ) det(a) , det(S) 
= (det(S))" 1 det(a) det(S) = det(a) . 



One can conclude 



dei(a) = Yl> 



"33 • 



(2.252) 



(2.253) 



(2.254) 
(2.255) 



3=1 
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We have assumed det(a) / 0. Accordingly, holds 

n 

Hajj^O (2.256) 

such that none of the eigenvalues of a vanishes, i.e., 

ajj / , for j = 1, 2, n (2.257) 
Substitution of the integration variables (2.250) allows one to express (2.250) 



+oo 



I = Jdyi ... Jdy n 

— oo — oo 

where we introduced the Jacobian matrix 



det 



<Hyu ■ ■ ■ , Vn) 
d(yi, ...,yn) 



Y.la kk y k _ (2.258) 



d{yi, ...,y n ) 



with elements 

According to (2.250) holds 
and, hence, 



+oo +oo 



k ~ 1 -tx 



which leaves us to determine integrals of the type 

+oo 



(2.259) 



d(yi, ...,y n ) 

Jjs = |^ • (2.260) 



J = S (2.261) 



det( d J; yu --- ,yn l ) = det(S) . (2.262) 



d(m,...,y n ) 

1 = det(S T S) = (detS) 2 (2.263) 



detS = ±1 (2.264) 



From (2.249) follows 
such that one can conclude 
One can right then (2.258) 

I = Jdy! ... Jdy n e^'^l 

— oo — oo 

+oo +oo n +oo 

= Jdme idl1 ^ ... Jdy n e ra ™ti = f[ Jdy k e™' (2.265) 



Jdxe icx2 (2.266) 
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where, according to (2.257) holds c ^ 0. 

We consider first the case c > and discuss the case c < further below. One can relate inte- 
gral (2.266) to the well-known Gaussian integral 

+oo 

jdx e~ cx2 = y^,c> 0. (2.267) 

— oo 

by considering the contour integral 

J = Idz e icz2 = (2.268) 



along the path 7 = 71 + 72 + 73 + 74 displayed in Figure 2.3. The contour integral (2.268) vanishes, 

2 

since e tcz is a holomorphic function, i.e., the integrand does not exhibit any singularities anywhere 
in C. The contour intergral (2.268) can be written as the sum of the following path integrals 



J — J\ + J2 + J3 + J4 ; Jk — 

7fe 



jdze icz2 (2.269) 



The contributions can be expressed through integrals along a real coordinate axis by realizing 
that the paths 7^ can be parametrized by real coordinates x 



71 : z = x J\ = J dx e tcx2 

-p 
v 

72 : z = ix + p J2 = J i dx e %c ( %x +v) 



-y/2p 



-ex 2 



73 : z = \A x J3 = j y/i dx e lc (v^a;) 2 

V2 P (2.270) 

= —y/i f dx e~ 

-\/2p 
. _ 

74 : z = ix — p J4 = J i dx e tc( - tx p > , 

-p 

for x,p G IR. 



Substituting — x for x into integral J4 one obtains 





J A = J H) dx e ic (-»+P) 2 

cix e ic (^-p) 2 = J 2 . (2.271) 



} * 
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Im(z) 



P Re(z) 



Figure 2.3: Contour path 7 in the complex plain. 

We will now show that the two integrals J2 and J4 vanish for p — > +00. This follows from the 
following calculation 



lim I J2 or 4 1 = hm 



v 

I 





< lim / |i| dx \ e ic (P 2 - x ^\ \ e ~ 2cx P\ 





(2.272) 



It holds \e i <P 2 - x2 )\ = 1 since the exponent of e is purely imaginary. Hence, 



v 

lim |J2or4| < hm / dx \e~ 2cxp \ 



lim 



1 - e" 2c P 



. 



2 c p 

J2 and J4 do not contribute then to integral (2.268) for p = +00. One can state accordingly 



(2.273) 



J 



= Jdxe icx2 - \fi Jt 
—00 

Using 2.267) one has shown then 



dx e 



= . 



(2.274) 



dx e l 



(2.275) 
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One can derive the same result for c < 0, if one chooses the same contour integral as (2.268), but 
with a path 7 that is reflected at the real axis. This leads to 

00 —00 

J = jdxe icx2 + jdxe cx2 = (2.276) 

—00 00 

and (c < 0) 



00 



dxe icx2 = J = \l — (2.277) 



We apply the above results (2.275, 2.277) to (2.265). It holds 



1 = n \ - = \ ,r ri " • ^ 



Noting (2.255) this result can be expressed in terms of the matrix a 



' (iir) n 
det(a) 



which concludes our proof. 



(2.279) 
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Chapter 3 

The Schrodinger Equation 



3.1 Derivation of the Schrodinger Equation 

We will consider now the propagation of a wave function ip(r,t) by an infinitesimal time step e. It 
holds then according to (2.5) 



ip(r,t + e) = / d 3 r (f)(f,t + e\f ,t) ip(r ,t) 
Jn 



(3.1) 



We will expand the l.h.s. and the r.h.s. of this equation in terms of powers of e and we will 
demonstrate that the terms of order e require that ip(f, t) satisfies a partial differential equation, 
namely the Schrodinger equation. For many situations, but by no means all, the Schrodinger 
equation provides the simpler avenue towards describing quantum systems than the path ingral 
formulation of Section 2. Notable exceptions are non-stationary systems involving time-dependent 
linear and quadratic Lagrangians. 

The propagator in (3.1) can be expressed through the discretization scheme (2.20, 2.21). In the 
limit of very small e it is sufficient to employ a single discretization interval in (2.20) to evaluate 
the propagator. Generalizing (2.20) to one obtains then for small e 



4>(r,t + e\f ,t) 
From this follows 



" m " 


1 j i 


.2mhe. 


exp U 



m (r — r*o) 2 



ip(r, t + e) 



d 3 r 



m 



2irihe 



exp 



m(r — r*o) 2 



- eU(r,t) 



e U(f, t) 



(3.2) 



Wo,t). (3.3) 



In order to carry out the integration we set ro = r + s and use s as the new integration variable. 
We will denote the components of s by (xi,X2,X3) T . This yields 



mt + e) = J^d Xl J^dx 2 J^dx 3 [ 



, 3 
m 1 2 
2iriHel 



x exp 



i m x\ + x\ + x\ 
h ~2 ~e 



eU(r,t) ^{r + s,t) 



(3.4) 



even m xi,X2,, X3 



51 



52 



Schrddinger Equation 



It is important to note that the integration is not over r, but over s = (x\,X2,xs) T , e.g. U(f, t) is a 
constant with respect to this integration. The integration involves only the wave function i/j(f-\-s, t) 
and the kinetic energy term. Since the latter contributes to (3.4) only for small x\ + x\ + x\ values 
we expand 

3 d 1 3 d 2 

ip(r + s,t) = ip(f,t) + ^2xj—ij(f,t) + - ^2 XjX k — — ip(f,t) + ... (3.5) 



j,k=i 



' dxjdxk 



assuming that only the leading terms contribute, a supposition which will be examined below. Since 
the kinetic energy contribution in (3.4) is even in all three coordinates x±, X2, x%, only terms of the 
expansion (3.5) which are even separately in all three coordinates yield non-vanishing contributions. 
It is then sufficient to consider the terms 



J J K 



of the expansion of if)(r + s, t). 

Obviously, we need then to evaluate integrals of the type 



/+oo 
dxx 2n exp (iax 2 ) , n = 0,1,2 
-oo 



According to (2.36) holds 
Inspection of (3.7) shows 



Io(a) 



I n +i(a) 



1 d_ 

i da 



I n (a). 



Starting from (3.8) one can evaluate recursively all integrals I n (a). It holds 



h (a) 



; h{a) 



3 

4^2 



2a V a 

It is now important to note that in case of integral (3.4) one identifies 

- = — = OM 

a m 

and, consequently, the terms collected in (3.6) make contributions of the order 

0(e§) , O(ei) , 0{el) , 0{el) . 



(3.6) 



(3.7) 

(3.8) 
(3.9) 

(3.10) 

(3.11) 
(3.12) 



Here one needs to note that we are actually dealing with a three-fold integral. According to (3.11) 
holds 



m 



l 7T 



-2irihe- 



= 1 



(3.13) 
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and one can conclude, using (3.10), 
ip(f, t + e) = exp 



-- h u { f,t) 



i>{f,t) + J^*V 2 i/>(r, t) + 0(f 2 ) 



This expansion in terms of powers of e suggests that we also expand 



V>(f,i + e) = iP(r,t) + e^(r,t) + 0(e 2 ) 



and 



exp 



-^(f, ( ) 



- i - ^U(f,t) + 0{e 2 ). 



(3.14) 

(3.15) 
(3.16) 



Inserting this into (3.14) results in 



(3.17) 



Obviously, this equation is trivially satisfied to order O(e ). In order 0(e) the equation reads 



d 

ih— ip(f, t) 



2m 



V 2 + U(f,t) 



(3.18) 



This is the celebrated time- dependent Schrodinger equation. This equation is often written in the 
form 

ih^-^(r,t) = H^(f,t) (3.19) 



where 



dt 



(3.20) 



3.2 Boundary Conditions 

The time-dependent Schrodinger equation is a partial differential equation, 1st order in time, 2nd 
order in the spatial variables and linear in the solution i/j(f,t). The following general remarks can 
be made about the solution. 

Due to its linear character any linear combination of solutions of the time-dependent Schrodinger 
equation is also a solution. 

The 1st order time derivative requires that for any solution a single temporal condition needs to be 
specified, e.g., ip(r,ti) = f(r). Usually, one specifies the so-called initial condition, i.e., a solution 
is thought for t > to and the solution is specified at the intial time to. 

The 2nd order spatial derivatives require that one specifies also properties of the solution on a 
closed boundary dfl surrounding the volume 0, in which a solution is to be determined. We will 
derive briefly the type of boundary conditions encountered. As we will discuss in Chapter 5 below 
the solutions of the Schrodinger equation are restricted to particular Hilbert spaces H which are 
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linear vector spaces of functions f(r) in which a scalar product between two elements f,g G H is 
defined as follows 

(f\9)n = [ d 3 rf{r)g{r) (3.21) 
Jn 

This leads one to consider the integral 

(f\H\g) n = ! d 3 rf*(f)Hg(f) (3.22) 
Jn 

where H is defined in (3.20). Interchanging f*(r) and g(r) results in 

(9\H\f)n = I d 3 rg(r)Hf*(r). (3.23) 
Jn 

Since H is a differential operator the expressions (3.22) and (3.23), in principle, differ from each 
other. The difference between the integrals is 



(g\H\f) n =-(g\H\f) n 

= I/*™ ("£ v2 ) ^ ~ l/ r ^ (-£ ) rw 

+ / d 3 rf*(r)U(r,t)g(r) - [ d 3 rg(r) U(f,t), f*(r) 
Jn Jn 

= / d3r f*^ ( V2 ) 5(0 " / d 3 rg(r) (V 2 ) f*(r) (3.24) 

Using Green's theorem 1 

In d 3 r ( /* (r) V 2 5 (r) - g(r) V 2 /* (r) ) 

= fasidS ■ (f*(mg(r) - g(mf*(r)) (3.25) 

one obtains the identity 

(f\H\g)n = (g\H\f)n + / da ■ P(f*,g\f) (3.26) 

J on 

where da ■ A{r) denotes an integral over the surface d£l of the volume Q, the surface elements 
da pointing out of the surface in a direction normal to the surface and the vector-valued function 
A{f) is taken at points f G dQ. In (3.26) the vector-valued function P(f*, g\r) is called the 
concomitant of H and is 

P(/*,<7|r) = (r(F)V 5 (r) - ^Vf^) (3.27) 

We will postulate below that H is an operator in H which represents energy. Since energy is a real 
quantity one needs to require that the eigenvalues of the operator H are real and, hence, that H is 
hermitian 2 . The hermitian property, however, implies 



(f\H\g) n = (g\H\f) n (3.28) 



1 See, for example, Calssical Electrodynamics, 2nd Ed., J. D. Jackson, (John Wiley, New York, 1975), Chapter 1. 
2 The reader is advised to consult a reference text on 'Linear Algebra' to follow this argument. 
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and, therefore, we can only allow functions which make the differential da ■ P(f*,g\r) vanish on 
dn. It must hold then for all / € HI 

f(f) = W,fedn (3.29) 

or 

da ■ V/(r) =0 W,f £ dn (3.30) 

Note that these boundary conditions are linear in /, i.e., if / and g satisfy these conditions than 
also does any linear combination af + [3g. Often the closed surface of a volume dfl is the union 
of disconnected surfaces 3 dQj, i.e., <9S7 = dfl\ n dQ.2 H dfls n . . . In this case one can postulate 
both conditions (3.29, 3.30) each condition holding on an entire surface dilj. However, to avoid 
discontinuities in ip(r, t) on a single connected surface d£lj only either one of the conditions (3.29, 
3.30) can be postulated. 

A most common boundary condition is encountered for the volume Q = R 3 in which case one 
postulates 

lim f(r) = "natural boundary condition" . (3.31) 
\r\— >oo 

In fact, in this case also all derivatives of f(f) vanish at infinity. The latter property stems from 
the fact that the boundary condition (3.31) usually arises when a particle existing in a bound state 
is described. In this case one can expect that the particle density is localized in the area where the 
energy of the particle exceeds the potential eenrgy, and that the density decays rapidly when one 
moves away from that area. Since the total probability of finding the particle anywhere in space is 

f d 3 r\f(r)\ 2 = 1 (3.32) 

the wave function must decay for |f| — > oo rapidly enough to be square integrable, i.e., obey (3.32), 
e.g., like exp(— nr), k > or like r~ a , a > 2. In either case does f(r) and all of its derivatives 
vanish asymptotically. 

3.3 Particle Flux and Schrodinger Equation 

The solution of the Schrodinger equation is the wave function ijj(r, t) which describes the state of 
a particle moving in the potential U(r,t). The observable directly linked to the wave function is 
the probability to find the particle at position f at time t, namely, \ip(f, t)\ 2 . The probability to 
observe the particle anywhere in the subvolume uj C O, is 

p(u,t) = [ d 3 r\i;(f,t)\ 2 . (3.33) 

The time derivative of p(u>, t) is 

dtp(u,t) = f d 3 r [ij*{r,t)d t ^{r,t) + ^(r, t)d t ^*(r, t) } . (3.34) 

J u> 

3 An example is a volume between two concentric spheres, in which case dQi is the inner sphere and dS~l2 is the 
outer sphere. 
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Using (3.19) and its conjugate complex 4 

-iH^- t r(r,t) = H4,*(r,t) (3.35) 

yields 

ihdtp(w,t) = [ d 3 r \i>*(r,t)Hip(r,t) - ip(r, t) Htp*(r, t) 1 . (3.36) 



According to (3.26, 3.27) this can be written 



ifrd t p{u,t) = / da- P(ip*(r,t),ip(r,t)r,t) . (3.37) 

Jdco 

If one applies this identity to oj = 17 one obtains according to (3.29, 3.30) dtp(Q, t) = 0. Accord- 
ingly the probability to observe the particle anywhere in the total volume 0, is constant. A natural 
choice for this constant is 1. One can multiply the solution of (3.18) by any complex number and 
accordingly one can define ip(f, t) such that 

/ d 3 r\^(r,t)\ 2 = 1 (3.38) 
Jn 

holds. One refers to such solution as normalized. We will assume in the remainder of this Section 
that the solutions discussed are normalized. Note that for a normalized wave function the quantity 

p(r,t) = \mt)\ 2 (3-39) 
is a probability density with units 1 /volume. 

The surface integral (3.37) can be expressed through a volume integral according to 

da-A(r) = f d 3 rV-A(r) (3.40) 



One can rewrite then (3.37) 

d 3 r(d tP (r,t) + V-J(f,t)) = (3.41) 



/ 

J OJ 



where 

j(r,t) = P(r,W,t) • (3-42) 

Using (3.27) one can express this 

Since (3.41) holds for any volume oj C SI one can conclude 

d t p{r, t) + V • j(f, t) = . (3.44) 
4 Note that the Hamiltonian H involves only real terms. 
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The interpretation of j(r,i) is that of density flux. This follows directly from an inspection of 
Eq. (3.41) written in the form 

d t / d 3 rp(f,t) = - da-j(f,t). (3.45) 

Jul J du> 

Obviously, j(f,t) gives rise to a descrease of the total probability in volume u due to the disap- 
pearence of probability density at the surface dto. Note that j(r,t) points in the direction to the 
outside of volume u. 

It is of interest to note from (3.43) that any real wave function ip(f, t) has vanishing flux anywhere. 
One often encounters wave functions of the type 

0(f) = f{r)e il ^ , for /(f) el. (3.46) 

The corresponding flux is 

j(r) = ^ f(r) , (3.47) 

i.e., arises solely from the complex factor exp(z k ■ f). Such case arose in Sect. 2 for a free parti- 
cle [c.f. (2.48, 2.71)], and for particles moving in a linear [c.f. (2.105, 2.125)] and in a quadratic 
[c.f. (2.148, 2.167)] potential. In Sect. 2 we had demonstrated that a factor exp(ip x /h) induces 
a motion of 1-dimensional wave packets such that p /m corresponds to the initial velocity. This 
finding is consistent with the present evaluation of the particle flux: a factor exp(ip x /h) gives rise 
to a flux Po/m, i.e., equal to the velocity of the particle. The generalization to three dimenisons 
implies then that the factor exp(i k • f) corresponds to an intial velocit hk/m and a flux of the same 
magnitude. 



3.4 Solution of the Free Particle Schrodinger Equation 

We want to consider now solutions of the Schrodinger equation (3.18) in = M.^ in the case 
U(f,t) = 

which describes the motion of free particles. One can readily show by insertion into (3.48) that the 
general solution is of the form 

^(f,t) = [27r]~§ I d 3 k 4>(k) exp (i(k ■ r - wt)) (3.49) 
where the dispersion relationship holds 

hk 2 

uo = — . (3.50) 
Obviously, the initial condition at ip(r,to) determines 4>(k). Equation (3.49) reads at t = to 

4>(r,t ) = [2vr]-5 J d 3 k4>(k)exp(i(k-r-Lut fj . (3.51) 
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The inverse Fourier transform yields 

c[>(k) = [2ir]~% / d 3 r exp(-ik ■ f ) ip(f ,t ) ■ 



(3.52) 



We have not specified the spatial boundary condition in case of (3.49). The solution as stated is 
defined in the infinte space = M^. If one chooses the initial state f(r) defined in (3.51) to 
be square integrable it follows according to the properties of the Fourier-transform that ij>(r, t) as 
given by (3.49) is square integrable at all subsequent times and, hence, that the "natural boundary 
condition" (3.31) applies. The ensuing solutions are called wave packets. 



Comparision with Path Integral Formulation 

One can write solution (3.49, 3.51, 3.52) above 



= / d 3 r (f>(f,t\f ,t ) ip(f ,t ) 



where 



(/>(r,t\r ,t ) 



l 3 



2vr 



/ d 3 k exp f 



d 3 k exp I ik • (r — ro) 



i h 2 k 2 



h 2m 



(t - t ) 



(3.53) 



(3.54) 



This expression obviously has the same form as postulated in the path integral formulation of Quan- 
tum Mechanics introduced above, i.e., in (2.5). We have identified then with (3.54) an alternative 
representation of the propagator (2.47). In fact, evaluating the integral in (3.54) yields (2.47). To 
show this one needs to note 

1 f + °° „ ( -r , \ i h 2 kf , 

2 ~y dkl exp \ zk n x ~ x o) ~ t ttzt v* - *o) 



m 



2mh{t - t ) 
This follows from completion of the square 

i h 2 k\ 



exp 



h 2m 

im (x — xo) 2 
2h t-t 



(3.55) 



ik\{x — xq) 



h 2m 

h(t - 1 

2m 



(t - t ) 



m x — x n 



1 2 



h t — t n 



+ 



i m (x — x ) 



K 2 t-t n 



(3.56) 



and using (2.247). 

Below we will generalize the propagator (3.54) to the case of non- vanishing potentials U(r), i.e., 
derive an expression similar to (3.54) valid for this case. The general form for this propagator 
involves an expansion in terms of a complete set of eigenfunctions as in (3.114) and (4.70) derived 
below for a particle in a box and the harmonic oscillator, respectively. In case of the harmonoc 
oscillator the expansion can be stated in a closed form given in (4.81) 
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Free Particle at Rest 



We want to apply solution (3.49, 3.52) to the case that the initial state of a 1-dimensional free 
particle ip(xo,t) is given by (??). The 1-dimensional version of (3.53, 3.54) is 



where 



</>(x,t\x ,to) 
Integration over xo leads to the integral 

/. 



/+oo 
dx 4>(x,t\x ,t ) ip(x ,t) 
-oo 



- J ^ dkexpliHx-xo) - - — {t-to) 



dxoexp —ikxo + — p xo 
n 



25 2 



exp 



(k 



f ) 2 5 2 



(3.57) 
(3.58) 

(3.59) 



which is solved through completion of the square in the exponent [c.f. (3.55, (3.56)]. The remaining 
integration over k leads to the integral 



J^dkexp^kx - \{k-^) 2 5 2 - • *£(*-*,) 

=???? 



Combining (3.57-3.60) yields with to = 

ip(x,t) = 





4 


1 


4 











(3.60) 



(3.61) 



x exp 



(x 



Po+\2 



25 2 {l + 



ty n . nt .p 

(1 - « — ^) + i^rx 



h 2 t 2 



rn 



5 2 ' 



h 



I Po t 
h 2m 



a result which is identical to the expression (??) obtained by means of the path integral propagator 
(2.46). We have demonstrated then in this case that the Schrodinger formulation of Quantum 
Mechanics is equivalent to the Feynman path integral formulation. 

Stationary States 

We consider now solutions of the time-dependent Schrodinger equation (3.19, 3.20) which are of 
the form 

i;(f,t) = f(t)Hr) ■ (3.62) 

We will restrict the space of allowed solutions to a volume such that the functions also make the 
concomitant (3.27) vanish on the surface <90 of 0, i.e., the functions obey boundary conditions of 
the type (3.29, 3.30). Accordingly, the boundary conditions are 



or 



0(r) = W,fedn 
da ■ Vcf>(r) =0 Vr,redn 



(3.63) 
(3.64) 
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and affect only the spatial wave function <f>(r). As pointed out above, a common case is = fioo 
and the 'natural boundary condition' (3.31). We will demonstrate that solutions of the type (3.62) 
do exist and we will characterize the two factors of the solution f(t) and 4>{r). We may note in 
passing that solutions of the type (3.62) which consist of two factors, one factor depending only on 
the time variable and the other only on the space variables are called separable in space and time. 
It is important to realize that the separable solutions (3.62) are special solutions of the time- 
dependent Schrodinger equation, by no means all solutions are of this type. In fact, the solutions 
(3.62) have the particular property that the associated probability distributions are independent 
of time. We want to demonstrate this property now. It follows from the observation that for the 
solution space considered (3.27) holds and, hence, according to (3.42) the flux j(r,t) vanishes on 
the surface of <9fl It follows then from (3.45) that the total probability 

/ d 3 rp(f,t) = f d\\^r,t)\ 2 = \f(t)\ 2 [ d 3 r\<p(r)\ 2 (3.65) 
Jn Jn Jn 

is constant. This can hold only if \f(t)\ is time-independent, i.e., if 

f(t) = e ia , a G R . (3.66) 
One can conclude that the probability density for the state (3.62) is 

|^(r,t)| 2 = |0(r)| 2 , (3.67) 
i.e., is time- independent. One calls such states stationary states. 

In order to further characterize the solution (3.62) we insert it into (3.19). This yields an expression 

Si(t)M?) = 92{t)h 2 {r) (3.68) 

where gi(t) = ihdtf(t), g-2{t) = f(t), h\(r) = (j)(f), and /i2(r) = H (f>(r). The identity (3.68) can 
hold only for all t and all r if g±(t) = E 52 (i) and Eh\(r) = h2(r) for some E € C. We must 
postulate therefore 

d t f(t) = Ef(t) 

H<f)(r) = E(j)(r) . (3.69) 

If these two equations can be solved simultaneously a solution of the type (3.62) exists. 
It turns out that a solution for f(t) can be found for any E, namely 

f[t) = f(0)exp(- l -E?J . (3.70) 

The task of finding solutions 4>{r) which solve (3.69) is called an eigenvalue problem. We will 
encounter many such problems in the subsequent Sections. At this point we state without proof 
that, in general, for the eigenvalue problems in the confined function space, i.e., for functions 
required to obey boundary conditions (3.63, 3.64), solutions exist only for a set of discrete E 
values, the eigenvalues of the operator H. At this point we will accept that solutions <j>{f) of the 
type (3.69) exist, however, often only for a discrete set of values E n , n = 1,2, .. . We denote the 
corresponding solution by ^>e(r). We have then shown that 

^{r,t) = /(O)expf-^ 4> E {r) where H = E <f> E {r) . (3.71) 
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is a solution of the time- dependent Schrodinger equation (3.19, 3.20). 

According to (3.66) E must be real. We want to prove now that the eigenvalues E which arise in 
the eigenvalue problem (3.71) are, in fact, real. We start our proof using the property (3.28) for 
the special case that / and g in (3.28) both represent the state ^(f), i.e., 



f d\<t>* E {r)H<t> E {?) = [ d 3 r (j) E (f) H (f>* E (r) . (3.72) 
Jn Jn 

According to (3.71) this yields 



E ( d 3 rf E (r)fe(r) = E* I cPr <f> E {f) (f>* E (7) . (3.73) 
Jn Jn 

from which follows E = E* and, hence, E G R. We will show in Section 5 that E can be 
interprerted as the total energy of a stationary state. 

Stationary State of a Free Particle 

We consider now the stationary state of a free particle described by 

Hr,t) = exp^—Et) M?) , -£ V2 ^ = ( 3 - 74 ) 

The classical free particle with constant energy E > moves without bounds in the space i^oo* As 
a result we cannot postulate in the present case that wave functions are localized and normalizable. 
We will wave this assumption as we always need to do later whenever we deal with unbound 
particles, e.g. particles scattered of a potential. 

The solution <pE{r) corresponding to the eigenvalue problem posed by (3.74) is actually best labelled 
by an index k, k € 

^(r) = JVexp(ifc-f) , = E. (3.75) 

One can ascertain this statement by inserting the expression for (t>^{r) into the eigenvalue problem 
posed in (3.74) using Vexp (^ik ■ = ikexp (^ik-f^j. The resulting total energy values E are 
positive, a property which is to be expected since the energy is purely kinetic energy which, of 
course, should be positive. 
The corresponding stationary solution 

t/j(f,t) = ex ^\~J L ^^ t ) expi^ik-fj (3.76) 

has kinetic energy h 2 k 2 /2m. Obviously, one can interpret then hk as the magnitude of the momen- 
tum of the particle. The flux corresponding to (3.76) according to (3.43) is 

j(r,t) = \N\ 2 -. (3.77) 
m 

Noting that hk can be interpreted as the magnitude of the momentum of the particle the flux is 
equal to the velocity of the particle v = hk/m multiplied by |iV| 2 . 
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3.5 Particle in One-Dimensional Box 

As an example of a situation in which only bound states exist in a quantum system we consider the 
stationary states of a particle confined to a one-dimensional interval [—a, a] C R assuming that the 
potential outside of this interval is infinite. We will refer to this as a particle in a one-dimensional 
'box'. 



Setting up the Space T\ of Proper Spatial Functions 

The presence of the infinite energy wall is accounted for by restricting the spatial dependence of the 
solutions to functions f(x) defined in the domain fii = [—a, a] C R which vanish on the surface 
<9f2i = {—a, a}, i.e., 

f € = {f : [-a, a] C 1 -» I, / continuous, U(±D) = (3.78) 



Solutions of the Schrodinger Equation in T\ 

The time-dependent solutions satisfy 

ihW(x,t) = _^^( x ,t). (3.79) 

The stationary solutions have the form ip(x,t) = exp(—iEt/h)(j)E(x) where 4>e(x) is determined 
by 

h 2 d 2 

-^j^Mx) = E<j) E (x) , <j>(±a) = 0. (3.80) 

We note that the box is symmetric with respect to the origin. We can expect, hence, that the 
solutions obey this symmetry as well. We assume, therefore, two types of solutions, so-called even 
solutions obeying 4>{x) = <j){—x) 

i \ h 2 k 2 
<j)%>(x) = Acoskx , = E (3.81) 

and so-called odd solutions obeying <p(x) = —(f>(—x) 

^\ x ) = Asinfcx;, ^ = E . (3.82) 

One can readily verify that (3.81, 3.82) satisfy the differential equation in (3.80). 
The boundary conditions which according to (3.80) need to be satisfied are 

4>^°\a) = and ^°\-a) = (3.83) 

The solutions (3.81, 3.82) have the property that either both boundary conditions are satisfied or 
none. Hence, we have to consider only one boundary condition, let say the one at x = a. It turns 
out that this boundary condition can only be satisfied for a discrete set of /c-values k n ,n G N. In 
case of the even solutions (3.81) they are 

Tl 7T 
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since for such k n 

/, \ /nira\ /mr\ 

cos{k n a) = cos ^— — J = cos ^-^"J = • (3.85) 

In case of the odd solutions (3.82) only the A; n -values 

T17T 

K = 2a ' n = 2 ' 4 ' 6 '-- ^ 3 - 86 ) 
satisfy the boundary condition since for such k n 

sm(k n a) = sin (— j = sin [-) = . (3.87) 

(Note that, according to (3.86), n is assumed to be even.) 

The Energy Spectrum and Stationary State Wave Functions 

The energy values corresponding to the /c ra -values in (3.84, 3.86), according to the dispersion 
relationships given in (3.81, 3.82), are 



h 2 * 2 2 , , 

E n = , n = 1,2,3... (3.88) 



where the energies for odd (even) n-values correspond to the even (odd) solutions given in (3.81) 
and (3.82), respectively, i.e., 

</>%>(a;x) = A n cos— — , n = 1,3,5... (3.89) 

and 

4>^{a-x) = ^nsin^ , n = 2,4,6... . (3.90) 

The wave functions represent stationary states of the particle in a one-dimensional box. The wave 
functions for the five lowest energies E n are presented in Fig. (3.1). Notice that the number of 
nodes of the wave functions increase by one in going from one state to the state with the next 
higher energy E n . By counting the number of their nodes one can determine the energy ordering 
of the wave functions. 

It is desirable to normalize the wave functions such that 

f +a dx\^°\a-x)\ 2 = 1 (3.91) 

J —a 

holds. This condition implies for the even states 

\A n \ 2 I dxcos 2 ^- = \A n \ 2 a = 1, n = 1,3,5... (3.92) 



2a 

and for the odd states 



\A n \ 2 I cfasin 2 ^ = \A n \ 2 a = 1, n = 2,4,6... (3.93) 



2a 

The normalization constants are then 



A n = J- . (3.94) 
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The Stationary States form a Complete Orthonormal Basis of T\ 

We want to demonstrate now that the set of solutions (3.89, 3.90, 3.94) 

Si = {<f> n (a;x),n = 1,2,3,...} (3.95) 



where 



Ua-x) - A /I/ COS ^ forn = 1,3,5... 

K{a,x) - y a | gin ^ fo r n = 2,4,6... [6 ^> 



together with the scalar product 5 

<te/(s) 5 (x) , f,g€?i (3.97) 

-a 

form an orthonormal basis set, i.e., it holds 

(0n|0m)ni = <5«m • (3.98) 

The latter property is obviously true for n = m. In case of n ^ m we have to consider three cases, 
(i) n, m both odd, (ii) n, m both even, and (hi) the mixed case. The latter case leads to integrals 

1 f +a i n7TX mirx 
WnWm)^ = - J dxcos ^2a~ sm ^2a~ ' {S.99) 

Since in this case the integrand is a product of an even and of an odd function, i.e., the integrand 
is odd, the integral vanishes. Hence we need to consider only the first two cases. In case of n, m 
odd, n m, the integral arises 



2d 



i f^dx Icosi^ + cos (!L^ I (3.100) 

The periods of the two cos-functions in the interval [—a, a] are N, N > 1. Obviously, the integrals 
vanish. Similarly, one obtains for n, m even 



2a 

i r-+a , (n—m)nx (n+m)irx / 1n1 \ 

a La dx [ COS — 2a I COS 2a \ ( 3 - 101 ) 

and, hence, this integral vanishes, too. 

Because of the property (3.98) the elements of B\ must be linearly independent. In fact, for 

oo 

fix) = Y,dnMa;x) (3.102) 

71=1 

holds according to (3.98) 

oo 

</|/> ni = £4- ( 3 - 103 ) 

n=l 



5 We will show in Section 5 that the property of a scalar product do indeed apply. In particular, it holds: 
</l/>ni = - f{x) = 0. 
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f(x) = implies = which in turn implies d n = since d\ > 0. It follows that B\ 

defined in (3.95) is an orthonormal basis. 

We like to show finally that the basis (3.95) is also complete, i.e., any element of the function space 
T\ defined in (3.78) can be expressed as a linear combination of the elements of B\ defined in 
(3.95, 3.96). Demonstration of completeness is a formidable task. In the present case, however, 
such demonstration can be based on the theory of Fourier series. For this purpose we extend the 
definition of the elements of T\ to the whole real axis through 

f:R^R; $(W) = c3([^ + D]o - D) (3.104) 

where [y] a = ymod2a. The functions / are periodic with period 2a. Hence, they can be expanded 
in terms of a Fourier series, i.e., there exist real constants {a n , n = 0, 1, 2, . . .} and {b n , n = 1, 2, . . .} 
such that 

oo oo 

~ v-^ mrx sr^ mrx 

J = Z^ anCOS ~2a~ + l^ anSm ~2a~ (3.105) 

n=0 n=l 

The functions / corresponding to the functions in the space T\ have zeros at x = ±ma, m = 
1, 3, 5 . . . Accordingly, the coefficients a n , n = 2, 4, . . . and b n , n = 1, 3, . . . in (3.105) must vanish. 
This implies that only the trigonometric functions which are elements of B\ enter into the Fourier 
series. We have then shown that any / corresponding to elements of T\ can be expanded in terms 
of elements in B\. Restricting the expansion (3.105) to the interval [—a, a] yields then also an 
expansion for any element in T\ and B\ is a complete basis for T\. 

Evaluating the Propagator 

We can now use the expansion of any initial wave function ip(x, to) in terms of eigenfunctions 
4> n (a;x) to obtain an expression for tp(x,t) at times t > to. For this purpose we expand 



ip(x,t ) = ^ d n cf) n (a; x) . (3.106) 

n=l 

Using the orthonormality property (3.98) one obtains 

/+a 

dxo<pm{a;xo)i/j(x ,to) = d m . (3.107) 
-a 

Inserting this into (3.106) and generalizing to t > to one can write 

ij)(x,t) = ^2 cf) n (a; x) c n (t) / dx </) n (a;xo)ijj(x ,to) (3.108) 
n=i J ' a 

where the functions c n (t) are to be determined from the Schrodinger equation (3.79) requiring the 
initial condition 

Cn(to) = 1 • (3.109) 
Insertion of (3.108) into the Schrodinger equation yields 

Yl™=i < t>n{a;x)dtc n (t) dx (f>n{a; x ) ip(x , to) = 

J2n=i {-k E n) 4> n {a; x) c n (t) dx (j) n (a; x ) ip{x ,t ) . (3.110) 
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Multiplying both sides by (j) m {a;x) and integrating over [—a, a] yields, according to (3.98), 

d t c m (t) = --E m c m (t), Cm(to) = 1. (3.111) 
The solutions of these equations which satisfy (3.109) are 

c m {t) = ex V (^- % -E m (t - t )) • (3.112) 

Equations (3.108, 3.112) determine now i[){x,t) for any initial condition i/j(x,to). This solution can 
be written 

/+a 
dx 4>(x,t\xo,t )ip(x ,to) (3.113) 
-a 

where 

4>(x,t\x ,t ) = ^2<p n (a;x)exp ( --E n (t - t ) J (f> n (a;x ) . (3.114) 

n=l ^ ' 

This expression has the same form as postulated in the path integral formulation of Quantum 
Mechanics introduced above, i.e., in (2.5). We have identified then with (3.114) the representation 
of the propagator for a particle in a box with infinite walls. 

It is of interest to note that (f)(x, t\xo, to) itself is a solution of the time-dependent Schrodinger 
equation (3.79) which lies in the proper function space (3.78). The respective initial condition 
is (j)(x, to\xo, to) = 5(x — xo) as can be readily verified using (3.113). Often the propagator 
(f)(x,t\xo,to) is also referred to as a Greens function. In the present system which is composed 
solely of bound states the propagator is given by a sum, rather than by an integral (3.54) as in the 
case of the free particle system which does not exhibit any bound states. 

Note that the propagator has been evaluated in terms of elements of a particular function space T\ , 
the elements of which satisfy the appropriate boundary conditions. In case that different boundary 
conditions hold the propagator will be different as well. 



Example of a Non-Stationary State 

As an illustration of a non-stationary state we consider a particle in an initial state 



ip(x ,to) 



1 



2ira 2 



exp 



~ + ik x 



, k 



15 

a 



(3.115) 



This initial state corresponds to the particle being localized initially near xo = with a velocity 
vo = \bh/ma in the direction of the positive x-axis. Figure 3.2 presents the probability distribution 
of the particle at subsequent times. One can recognize that the particle moves first to the left and 
that near the right wall of the box interference effects develop. The particle moves then to the 
left, being reflected at the right wall. The interference pattern begins to 'smear out' first, but the 
collision with the left wall leads to new interference effects. The last frame shows the wave front 
reaching again the right wall and the onset of new interference. 
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Figure 3.2: Stroboscopic views of the probability distribution \i/j(x, t)\ for a particle in a box 
starting in a Gaussian distribution with momentum 15h/a. 
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Summary: Particle in One-Dimensional Box 

We like to summarize our description of the particle in the one-dimensional box from a point of 
view which will be elaborated further in Section 5. The description employed a space of functions 
T\ defined in (3.78). A complete basis of T\ is given by the infinite set B\ (3.95). An important 
property of this basis and, hence, of the space T\ is that the elements of the basis set can be 
enumerated by integer numbers, i.e., can be counted. We have defined in the space T\ a scalar 
product (3.97) with respect to which the eigenfunctions are orthonormal. This property allowed us 
to evaluate the propagator (3.114) in terms of which the solutions for all initial conditions can be 
expressed. 



3.6 Particle in Three-Dimensional Box 

We consider now a particle moving in a three-dimensional rectangular box with side lengths 
2ai, 2a 2 , 2a3. Placing the origin at the center and aligning the x\, x 2 , X3~axes with the edges 
of the box yields spatial boundary conditions which are obeyed by the elements of the function 
space 

T 3 = {/ : — ► R, / continuous, I5(>r>^,^>£,^j^) = Y- V (-/^,^v,^£) T G d^} . (3.116) 

where O is the interior of the box and d£l its surface 

= [— oi, oi] <8> [— a 2 , a 2 ] <8> [— a 3 , a 3 ] C R F 
dfl = {(xi,x 2 ,x 3 ) T G xi = ±a\} U {(xi, x 2 , x 3 ) T G x 2 = ±a 2 } 

U {(x!,X2, x 3 ) T £ft,x 3 = ±a 3 } . (3.117) 

We seek then solutions of the time-dependent Schrodinger equation 

h 2 

ihd t ip(x,t) = Hil>(xi,x 2 ,x 3 ,t) , H = -— (d 2 +dl + dl) (3.118) 
which are stationary states. The corresponding solutions have the form 



il>(x 1 ,X2,x s ,t) = exp ^--Etj <p E (xi,x 2 ,x 3 ) (3.119) 

where 4>e(xi, x 2 , x 3 ) is an element of the function space T 3 defined in (3.116) and obeys the partial 
differential equation 

H 4>e(xi,x 2 ,x 3 ) = E(f) E (xi,X2,x 3 ) . (3.120) 

Since the Hamiltonian H is a sum of operators 0(xj) each dependent only on a single variable, i.e., 
H = 0(xi) + 0(x 2 ) + 0(^3), one can express 

3 

(f>( Xl ,X2,x 3 ) = Y[<t>V\xj) (3.121) 

3=1 

where 

-^d 2 ^\ Xj ) = EjftXxj) , <f>W(± aj ) = 0, j = 1,2,3 (3.122) 
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and Ei + E 2 + E 3 = E. Comparing (3.122) with (3.80) shows that the solutions of (3.122) are 
given by (3.96) and, hence, the solutions of (3.120) can be written 



( ni ,n 2 ,n s ){ai,a 2 ,a 3 ;xi,x 2 ,x 3 ) = 4> ni {ai; xt) (p n2 {a 2 ; x 2 ) 4>n 3 {a 3 ;x 3 ) 
ni,n 2 ,n 3 = 1,2,3, .. . 



and 



-^(rii,n 2 ,n3) 



h 2 TT 2 



n 



Tin 



i 2 + ~f + -f ) ,ni,n 2 ,n 3 = 1,2,3,. 
8m \ af a§ 



(3.123) 
(3.124) 



The same considerations as in the one-dimensional case allow one to show that 

#3 = {(f>( ni!n2 ^ 3 )(a 1 ,a 2 ,a 3 -,xi,x 2 ,x 3 ) , ni,n 2 ,n 3 = 1,2,3,...} (3.125) 
is a complete orthonormal basis of T 3 and that the propagator for the three-dimensional box is 



(r,t\f ,t ) 



< ^(™i,'"'j 

ni,n2,n3=l 



exp 



in3 )(ai,a2,a 3 ;r) 

E (nm 2 n 3 ) (* ~ *o) ) 0(ni,n 2 ,n 3 )( a l' a 2' a 3! ^b) • 



(3.126) 



Symmetries 

The three-dimensional box confining a particle allows three symmetry operations that leave the box 
unchanged, namely rotation by ir around the x\,x 2 ,x 3 -&x.es. This symmetry has been exploited 
in deriving the stationary states. If two or all three orthogonal sides of the box have the same 
length further symmetry operations leave the system unaltered. For example, if all three lengths 
^l, 02,03 are identical, i.e., a± = a 2 = a 3 = a then rotation around the xi,x 2 ,x 3 -axes by tt/2 
also leaves the system unaltered. This additional symmetry is also reflected by degeneracies in the 
energy levels. The energies and corresponding degeneracies of the particle in the three-dimensional 
box with all side lengths equal to 2a are given in the following Table: 



m 


n 2 


n 3 


E/[ti 2 ir 2 /8ma 2 } 


degeneracy 


1 


1 


1 


3 


single 


1 


1 


2 


6 


three-fold 


1 


2 


2 


9 


three-fold 


1 


1 


3 


11 


three-fold 


2 


2 


2 


12 


single 


1 


2 


3 


14 


six-fold 


2 


2 


3 


17 


three-fold 


1 


1 


4 


18 


three-fold 


2 


3 


3 


22 


three-fold 


3 


3 


3 


27 


single 


1 


1 


5 


27 


three-fold 



One can readily verify that the symmetry of the box leads to three- fold and six- fold degeneracies. 
Such degeneracies are always a signature of an underlying symmetry. Actually, in the present case 
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'accidental' degeneracies also occur, e.g., for (m, ri2,nz) = (3,3,3) (1, 1,5) as shown in the Table 
above. The origin of this degeneracy is, however, the identity 3 2 + 3 2 + 3 2 = l 2 + l 2 + 5 2 . 
One particular aspect of the degeneracies illustrated in the Table above is worth mentioning. We 
consider the degeneracy of the energy £'122 which is due to the identity £122 = -E212 = £221- Any 
linear combination of wave functions 

4>{r) = a 0(i,2,2) (a, a, a; r) + (3 0(2,1,2)0, «> a; r) + 7 0(2,2,1) (a, a, a; r) (3.127) 

obeys the stationary Schrodinger equation H 0(r) = E122 <t>{f)- However, this linear combination 
is not necessarily orthogonal to other degeneraste states, for example, 0(i,2,2)( a > a > a i 0- Hence, 
in case of degenerate states one cannot necessarily expect that stationary states are orthogonal. 
However, in case of an n-fold degeneracy it is always possible, due to the hermitian character of 
H, to construct n orthogonal stationary states 6 . 



This is a result of linear algebra which the reader may find in a respective textbook. 
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Chapter 4 

Linear Harmonic Oscillator 



The linear harmonic oscillator is described by the Schrodinger equation 

ihd t ip(x,t) = Hip(x,t) (4.1) 

for the Hamiltonian 

^ = -£S + r- v ' < 42 > 

It comprises one of the most important examples of elementary Quantum Mechanics. There are sev- 
eral reasons for its pivotal role. The linear harmonic oscillator describes vibrations in molecules and 
their counterparts in solids, the phonons. Many more physical systems can, at least approximately, 
be described in terms of linear harmonic oscillator models. However, the most eminent role of this 
oscillator is its linkage to the boson, one of the conceptual building blocks of microscopic physics. 
For example, bosons describe the modes of the electromagnetic field, providing the basis for its 
quantization. The linear harmonic oscillator, even though it may represent rather non-elementary 
objects like a solid and a molecule, provides a window into the most elementary structure of the 
physical world. The most likely reason for this connection with fundamental properties of matter 
is that the harmonic oscillator Hamiltonian (4.2) is symmetric in momentum and position, both 
operators appearing as quadratic terms. 

We have encountered the harmonic oscillator already in Sect. 2 where we determined, in the context 
of a path integral approach, its propagator, the motion of coherent states, and its stationary states. 
In the present section we approach the harmonic oscillator in the framework of the Schrodinger 
equation. The important role of the harmonic oscillator certainly justifies an approach from two 
perspectives, i.e., from the path integral (propagotor) perspective and from the Schrodinger equa- 
tion perspective. The path integral approach gave us a direct route to study time-dependent 
properties, the Schrodinger equation approach is suited particularly for stationary state properties. 
Both approaches, however, yield the same stationary states and the same propagator, as we will 
demonstrate below. 

The Schrodinger equation approach will allow us to emphasize the algebraic aspects of quantum 
theory. This Section will be the first in which an algebraic formulation will assume center stage. 
In this respect the material presented provides an important introduction to later Sections using 
Lie algebra methods to describe more elementary physical systems. Due to the pedagodical nature 
of this Section we will link carefully the algebraic treatment with the differential equation methods 
used so far in studying the Schrodinger equation description of quantum systems. 
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In the following we consider first the stationary states of the linear harmonic oscillator and later 
consider the propagator which describes the time evolution of any initial state. The stationary states 
of the harmonic oscillator have been considered already in Chapter 2 where the corresponding wave 
functions (2.235) had been determined. In the framework of the Schrodinger equation the stationary 
states are solutions of (4.1) of the form ip(x,t) = exp(—iEt/h)4>E(x,t) where 

H4> E {x) = E 4>e(x) . (4.3) 

Due to the nature of the harmonic potential which does not allow a particle with finite energy to 
move to arbitrary large distances, all stationary states of the harmonic oscillator must be bound 
states and, therefore, the natural boundary conditions apply 

lim <f> E (x) = . (4.4) 

Equation (4.3) can be solved for any E £ R, however, only for a discrete set of E values can the 
boundary conditions (4.4) be satisfied. In the following algebraic solution of (4.3) we restrict the 
Hamiltonian H and the operators appearing in the Hamiltonian from the outset to the space of 
functions 

M = {/ : R ->■ R, U € Coo, lim U{^) = (4.5) 

where denotes the set of functions which together with all of their derivatives are continuous. 
It is important to keep in mind this restriction of the space, in which the operators used below, 
act. We will point out explicitly where assumptions are made which built on this restriction. If 
this restriction would not apply and all functions / : R — > R would be admitted, the spectrum of 
H in (4.3) would be continuous and the eigenfunctions <Pe(x) would not be normalizable. 



4.1 Creation and Annihilation Operators 

The Hamiltonian operator (4.2) can be expressed in terms of the two operators 

h d 

p = —— , x = x (4.6) 
i dx 

the first being a differential operator and the second a multiplicative operator. The operators act 
on the space of functions M\ defined in (4.5). The Hamiltonian H can be expressed in terms of the 
operators acting on the space (4.5) 

H = —p 2 + -muj 2 x 2 (4.7) 
2m 2 

which is why these operators are of interest to us. 

The cardinal property exploited below, beside the representation (4.7) of the Hamiltonian, is the 
commutation property 

[p, x] = * 1 (4.8) 

which holds for any position and momentum operator. This property states that p and x obey an 
algebra in which the two do not commute, however, the commutator has a simple form. In order 
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to derive (4.8) we recall that all operators act on functions in N\ and, hence, the action of [p, x] 
on such functions must be considered. One obtains 

From this follows (4.8). 

Our next step is an attempt to factorize the Hamiltonian (4.7) assuming that the factors are easier 
to handle than the Hamiltonian in yielding spectrum and eigenstates. Being guided by the identity 
for scalar numbers 

(b-ic)(b + ic) = b 2 - i{cb-bc) + c 2 = b 2 + c 2 (4.10) 

we define 



— - x — p , a = \ — —x + p . (4.11 

The reader may note that we have attempted, in fact, to factor H/Hlj. Since d + and d~ are 
operators and not scalars we cannot simply expect that the identy (4.10) holds for d + and a~ since 
[a - , a + ] = a~a + — a + a~ does not necessarily vanish. In fact, the commutator property (4.8) 
implies 

[a~,a + ] = 11 . (4.12) 
To prove this commutation property we determine using (4.11) 

„ mu> „ 9 1 .9 i r „ / . x 

(4.7) and (4.8) yield 

a-a + = —H + - E . (4.14) 
raj 2 

Similarly one can show 

a + a~ = —H - - 1 . (4.15) 
&j 2 v 7 

(4.14) and (4.15) together lead to the commutation property (4.12). 

Before we continue we like to write (4.14, 4.15) in a form which will be useful below 

H = huja-a + - (4.16) 

H = hwa + a- + ^E . (4.17) 
We also express and a directly in terms of the coordinate x and the differential operator ^ 



mu h d „_ mu! „ h d , . 

^r-v^^' a ~ = + V^^- (4 - 18) 

It is of interest to note that the operators a + and d~ are real differential operators. 
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Relationship between <r and a 

The operators a + and a~ are related to each other by the following property which holds for all 
functions /, g € Mi 

/+oo r+oc 
dxf{x)a + g{x) = / dxg(x)a-f(x) . (4.19) 
-oo J — oo 

This property states that the operators a + and a~ are the adjoints of each other. The property 
follows directly from (??). Using (??) we like to state (4.19) in the form 1 



{f\a + 9)^ = (Qla-f)^ ■ (4.20) 

In the following we will determine the spectrum of H and its eigenstates. The derivation will be 
based solely on the properties (4.12, 4.16, 4.17, 4.19). 

a + and a as Ladder Operators 

The operators a + and a~ allow one to generate all stationary states of a harmonic oscillator once 
one such state 4>e(x) 

H<f> E (x) = EM*) (4-21) 
is available. In fact, one obtains using (4.16, 4.17, 4.21) 

H a~ 4>e{x) = (hujd~d + — — IL ) a - (/)e(x) 

= ar{hjja + aT + — 11 — Huj IL ) 4>e{x) 
= a~ {H — huj)<pE( x ) 

= (E - hw)a~ (f> E (x) . (4.22) 
Similarly one can show using again (4.16, 4.17, 4.21) 

H a + 4>e{x) = {fyjja + aT + — IL ) a + 4>e{x) 

= a + {hjja~a + — — II + Huj IL ) 4>e{x) 
= a + (H + hLo)4> E {x) 

= (E + hw)d + <j) E (x) . (4.23) 

Together, it holds that for a stationary state <Pe{x) of energy E defined through (4.21) a~4>E(x) is 
a stationary state of energy E — htu and a + 4>E(x) is a stationary state of energy E + htu. 
The results (4.22, 4.23) can be generalized to m-fold application of the operators a + and a~ 

H (a+) m <f> E {x) = (E + mhu) (a+) m <j> E {x) 

H (a~) m 4> E {x) = (E - mhuj) (a~) m (f> E {x) . (4.24) 



J This property states that the operators in the function space A/"i are the hermitian conjugate of each other. This 
property of operators is investigated more systematically in Section 5. 
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One can use these relationships to construct an infinite number of stationary states by stepping up 
and down the spectrum in steps of ±hu. For obvious reasons one refers to a + and d~ as ladder 
operators. Another name is creation (a + ) and annihilation (a - ) operators since these operators 
create and annihilate vibrational quanta huj. There arise, however, two difficulties: (i) one needs to 
know at least one stationary state to start the construction; (ii) the construction appears to yield 
energy eigenvalues without lower bounds when, in fact, one expects that E = should be a lower 
bound. It turns out that both difficulties can be resolved together. In fact, a state 4>o(x) which 
obeys the property 

a~M x ) = (4.25) 

on one side would lead to termination of the sequence Eq + m, m £ Z when m is decreased, on 
the other side such a state is itself an eigenstate of H as can be shown using (4.17) 

Hcf) (x) = {huoa + a- + ^-TL)M X ) = ^Hlj (j) (x) . (4.26) 

Of course, the solution 4>o(x) of (4.25) needs to be normalizable in order to represent a bound state 
of the harmonic oscillator, i.e., ^o(^) should be an element of the function space Afi defined in 
(4.5). 

The property (??) has an important consequence for the stationary states 4>e(x). Let 4>e(x) and 
4>E'{x) be two normalized stationary states corresponding to two different energies E, E' , E ^ E' . 
For f(x) = 4>e(x) and g(x) = <j>E'(x) in (??) follows (Note that according to (??) the eigenvalue 
E is real.) 

= {MHMn^ ~ (^eWe)^ = (E' - E){<j> E \<i> E ,) noo . (4.27) 
Since E ^ E' one can conclude 

(^l^)nco = • (4.28) 
4.2 Ground State of the Harmonic Oscillator 

A suitable solution of (4.25) can, in fact, be found. Using (4.6, 4.11) one can rewrite (4.25) 

d 



where 



(hj + y ) My) = o (4.29) 



Assuming that </>o(y) does not vanish anywhere in its domain ] — oo, +oo[ one can write (4.29) 

1 d My) == ^My) = -v, (4.3i) 



My) d y dy 

the solution of which is 



for some constant cq or 



inMv) = -\y 2 + co (4.32) 



o»(jj) - r cxp ( - X -y 2 ) . 
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This solution is obviously normalizable. The conventional normalization condition 

(<fo|<£o)ax, = 1 

reads 



/+oo 
dx \<f) (x 
-oo 

The appropriate ground state is 



2 _ I |2 

— l c o| 



J — c 



dxexp 



mujx 



</>oO) 



mw 



exp 



2fr 



mux 



l c o| 



7r/l 



2fr 



(4.34) 
(4.35) 

(4.36) 



Since the first order differential equation (4.25) admits only one solution there is only one set of 
states with energy E + mhuj, m £ Z which properly terminate at some minimum value E + 
mohu > 0. We recall that according to (4.26) the energy value associated with this state is \hio. 
This state of lowest energy is called the ground state of the oscillator. The set of allowed energies 
of the oscillator according to (4.24) can then be written as follows 



E n = hw{n + -), 



n = 0, 1, 2, ... 



(4.37) 



It is most remarkable that the energy ^fkv of the ground state is larger than the lowest classically 
allowed energy E = 0. The reason is that in the Hamiltonian (4.2) there are two competing 
contributions to the energy, the potential energy contribution which for a state 5(x), i.e., a state 
confined to the minimum corresponding to the classical state of lowest energy, would yield a van- 
ishing contribution, and the kinetic energy contribution, which for a narrowly localized state yields 
a large positive value. The ground state (4.36) assumes a functional form such that both terms 
together assume a minimum value. We will consider this point more systematically in Section 5. 



4.3 Excited States of the Harmonic Oscillator 

Having obtained a suitable stationary state with lowest energy, we can now construct the stationary 
states corresponding to energies (4.37) above the ground state energy, i.e., we construct the states 
for n = 1,2,..., the so-called excited states. For this purpose we apply the operator d + to the 
ground state (4.36) n times. Such states need to be suitably normalized for which purpose we 
introduce a normalization constant c' n 

4> n {x) = c' n {a + ) n 0o(x) . n = 0, 1, 2, . . . (4.38) 

These states correspond to the energy eigenvalues (4.39), i.e., it holds 

HMx) = hu(n + ^ n = 0, 1, 2, ... (4.39) 

We notice that the ground state wave function (f>o(x) as well as the operators (a + ) n are real. We 
can, therefore, choose the normalization constants c' n and the functions (j) n {x) real as well. 
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We need to determine now the normalization constants c' n . To determine these constants we adopt 
a recursion scheme. For n = holds c' = 1. We consider then the situation that we have obtained 
a properly normalized state 4> n (x). A properly normalized state <f> n +i(x) is then of the form 

4> n +i{x) = a n a + (j) n (x) (4.40) 

for some real constant a n which is chosen to satisfy 

(0n+l|0TH-l)fioo = a n (a +( ? !, n|a + ^n)Q oci = 1- (4.41) 

Employing the adjoint property (4.20) yields 

a\ (cP n \a-a + <p n ) noc = 1 • (4.42) 

Using (4.14) together with H 4> n (x) = huj(n+^) 4> n (x) leads to the condition (note that we assumed 
4>n(x) to be normalized) 

a 2 n (n + \ ){<j) n \a-a + (l) n ) Qoo = a 2 n (n + 1) = 1 (4.43) 
From this follows a n = 1/y/n + 1 and, according to (4.40), 

a + (f) n (x) = Vn + l(j) n+1 (x) . (4.44) 
One can conclude then that the stationary states of the oscillator are described by the functions 

4> n {x) = -L= (a+) n 4> {x) , n = 0,1,2,... (4.45) 
\/n\ 



We like to note that these functions according to (4.28) and the construction (4.40-4.45) form an 
orthonormal set, i.e., they obey 

{(f>n\<t>n>) = <W , n,ri = 0,1,2,... . (4.46) 

According to (4.24) holds in analogy to (4.40) 

<f>„-i{x) = I3 n a~ <p n {x) (4.47) 

for some suitable constants (3 n . Since d~ is a real differential operator [see (4.18)] and since the 
4>n(x) are real functions, [3 n must be real as well. To determine f3 n we note using (4.20) 

1 = {(^n-Mn-i)^ = Pl(a~(t>n\a"(t>n}n x = fil (4>n\a + cT 4>n}n x ■ (4.48) 
Equations (4.15 , 4.39) yield 

1 = Pln^n)^ = fan. (4.49) 
From this follows (3 n = l/^/n and, according to (4.47), 

aT (f) n (x) = y/n~(f) n -i(x) . (4.50) 

Repeated application of this relationship yields 



<Pns(x) = V^r^ ( a ") s Mx) ■ (4.5i) 
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Evaluating the Stationary States 

We want to derive now an analytical expression for the stationary state wave functions 4> n {x) 
denned through (4.39). For this purpose we start from expression (4.45), simplifying the calculation, 
however, by introducing the variable y defined in (4.30) and employing the normalization 



f 



dy 4>n(y) 



1 



(4.52) 



This normalization of the wave functions differs from that postulated in (4.35) by a constant, 
n-independent factor, namely the square root of the Jacobian dx/dy, i.e., by 



/ dx 




[muj- 


1 dy 







(4.53) 



We will later re-introduce this factor to account for the proper normalization (4.35) rather than 
(4.52). 

In terms of y the ground state wave function is 



Mv) 



7T 4 e 2 



and d + is 



1 
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y - 



dy) 



The eigenstates of the Hamiltonian are then given by 

1 2 2 

1 y y 



<Pn(y) 



y/2 n n\ y/lf 



e 2 e 2 



dy 



e 2 



(4.54) 
(4.55) 

(4.56) 



Relationship to Hermite Polynomials 

We want to demonstrate now that the expression (4.56) can be expressed in terms of Hermite 
polynomials H n (y) introduced in Sect. 2.7 and given, for example, by the Rodrigues formula (2.200). 
We will demonstrate below the identity 



H n {y) = 




(4.57) 



such that one can write the stationary state wave functions of the harmonic oscillator 

My) = / -^-= e- y 4 Hn (y). (4.58) 

This result agrees with the expression (2.233) derived in Sect. 2.7. It should be noted that the 
normalization (4.28) of the ground state and the definition (4.57) which includes the factor \j\fn\ 
according to Eqs. (4.40-4.46) yields a set of normalized states. 
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To verify the realtionship between the definition (4.57) of the hermite polynomials and the definition 
given by (2.200) we need to verify 

y-jVe-T = (-l)' l e^e-» 2 . (4.59) 
dy) K ' dy n K ' 



which implies 

H n (y) = (-Ife^e-f 2 (4.60) 
and, hence, the Rodrigues formula (2.200). 

We prove (4.59) by induction noting first that (4.59) holds for n = 0, 1, and showing then that the 
property also holds for n + 1 in case it holds for n, i.e., 

9(v) = i-^e^^e-y* = {y-l y ) ^ ■ ( 4 - 61 ) 

One can factor g(y) and employ (4.59) as follows 

a 2 d _v 2 _ i? d n _ 2 

q(v) = -e^—e "2" (-l) n - — e y 

1? d ( d Y l _it 

= -e 2 — e 2 y_ e 2 . (4.62) 



V dy 

Denoting f(y) = (y — d/dy)exp(— y 2 /2) and employing 

^e"^/(y) = -ye-^fiy) + ^^f(y) (4.63) 

one obtains 

</(*/) = (y - ^) /(y) (4.64) 

which implies that (4.61) and, therefore, (4.59) hold. 



4.4 Propagator for the Harmonic Oscillator 

We consider now the solution of the time-dependent Schrodinger equation of the harmonic oscillator 
(4.1, 4.2) for an arbitrary initial wave function ip(xo,to). Our derivation will follow closely the 
procedure adopted for the case of a 'particle in a box' [see Eqs. (3.106-3.114)]. For the sake of 
notational simplicity we employ initially the coordinate y as defined in (4.30) and return later to 
the coordinate x. 

Starting point of our derivation is the assumption that the initial condition can be expanded in 
terms of the eigenstates 4> n (y) (4.39) 

oo 

ip(yo,to) = ^2 d n 4> n (y ) . (4.65) 

?1=0 
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Such expansion is possible for any f(yo) = ip(yo,to) which is an element of Mi defined in (4.5), 
a supposition which is not proven here 2 . Employing orthogonality condition (4.46) the expansion 
coefficients d n are 

/+oo 
dyo ip(yo, t ) (f> n (yo) ■ (4.66) 
-oo 

One can extend expansion (4.65) to times t > to through insertion of time-dependent coefficients 

Cn(h) 

oo 

*l>(V,h) = d n<t>n(y)Cn(t) (4.67) 

n=0 

for which according to (3.108-3.112) and (4.39) holds 

c n (ti) = exp(^-ioj{n + |) (ti - t )) • (4.68) 
Altogether one can express then the solution 

/+oo 

dyo(f>(y,t\yo,t )i(>(yo,t) (4.69) 
-oo 

where 

oo 

0(y,ti|y o ,to) = ^2Mv)Mvo)t n+ 3 , t = e'^'^ (4.70) 

n=0 

is the propagator of the linear harmonic oscillator. We want to demonstrate now that this propaga- 
tor is identical to the propagator (2.147) for the harmonic iscillator determined in Sect. sec:harm. 
In order to prove the equivalence of (4.70) and (2.147) we employ the technique of generating 
functions as in Sect. 2.7. For this purpose we start from the integral representation (2.225) which 
allows one to derive a generating function for products of Hermite polynomials which can be applied 
to the r.h.s. of (4.70). We consider for this purpose the following expression for \t\ < 1 

< A V* H n {y)e-y 2 l 2 H n {yo)e-yV\ n 
w(y,yo,t) = ^ — j= 1 . (4.71) 

n=0 v 

Applying (??) to express H n (y) and H n (y ) yields 

exp(— 2tuv) 



/„2 ,.2\ p+oo f+co 00 -i 

w(y, y ,t) = exp^ + ljjf du J dv J2^( ~ 2tuv T x 

x exp( -u 2 - v 2 + 2iyu + 2iy v ) . (4.72) 

2 A demonstration of this property can be found in Special Functions and their Applications by N.N.Lebedev 
(Prentice Hall, Inc., Englewood Cliffs, N.J., 1965) Sect. 4.15, pp. 68; this is an excellent textbook from which we 
have borrowed heavily in this Section. 
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Carrying out the sum on the r.h.s. one obtains 



w(y,yo,t) = 7r- 3 / 2 exp ( V — + ^ J dv exp( -v z + 2iy v ) x 



»+oo 



/+oo 
duexp( — u 2 — 2u(tv — iy)) . 
-co 



(4.73) 



■ v /7rexp( t 2 v 2 — 2iytv — y 2 ) 



The incomplete Gaussian integral 

f + oo 

dx e 



f 



-«V-2fa = V^^/a 2 R e a 2 > 



(4.74) 



applied once results in 

/ 2 2\ /-+00 

w(y,yo,t) = 7r lex P( _ y + ^J y dt>exp(-(l - t 2 ) t> 2 - 2i (yi - y ) f) • 
Applying (4.74) a second time yields finally together with the definition (4.71) of w(y,yo,t) 



(4.75) 



^(i-t 2 ) 



exp 



v-oo H n (y)e-y 2 / 2 H n (y )e-yp 2 n 



2 n n! Vtt 

One can express this in terms of the stationary states (4.58) of the harmonic oscillator 

°° - r , , . .2 



(4.76) 



! 1 1 + t t 

Y,My)Mvo)t n = exp - ( y 2 + y 2 )^-^ + 2 yy 0r — 2 ■ ( 4 - 77 ) 



The sum in (4.70) is, indeed, identical to the generating function (4.77), i.e., it holds 



4>{y,h\yo,to) = 



y/2i 7r F(ti, to) 



exp 



{v 2 + vl) G(ti,t Q ) - iyy 



1 



F(h,t ) 



where 



and 



F(h,t ) 



I _ e -2iu>(t 1 -t ) 
2i e -iu{ti-t ) 



smuj{ti — to) 



G(h,t ) = i 



1 + e -M«(ti-to) _ cOS^(ti - t ) 



(4.78) 



(4.79) 



(4.80) 



1 - e - 2iw (*i-*o) sino;(ti - t ) 

We can finally express the propagator (4.78) in terms of the coordinates x and Xq. This requires 
that we employ (4.30) to replace y and yo and that we multiply the propagator by y^muj/h, i.e., 
by a factor y/muj/h for both <f> n (y) and <ft n {yo)- The resulting propagator is 



(a;,£|xo,to) 



muj 



6XP j 2^sinL^-to) [ + ^ COS " ( ' 1 " t0) ~ 2 " X ° ] | • (481) 
This result agrees with the propagator (2.147) derived by means of the path integral description. 



2iirhsmuj(ti — to) 

[ (x 2 + Xq) cosw(ti — to) — 2 x xo ] 
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4.5 Working with Ladder Operators 

In the last section we have demonstrated the use of differential equation techniques, the use of 
generating functions. We want to introduce now techniques based on the ladder operators a + and 
d~. For the present there is actually no pressing need to apply such techniques since the techniques 
borrowed from the theory of differential equations serve us well in describing harmonic oscillator 
type quantum systems. The reason for introducing the calculus of the operators a + and a~ is that 
this calculus proves to be useful for the description of vibrations in crystals, i.e., phonons, and of 
the modes of the quantized electromagnetic field; both quantum systems are endowed with a large 
number of modes, each corresponding to a single harmonic oscillator of the type studied presently 
by us. It is with quantum electrodynamics and solid state physics in mind that we cease the 
opportunity of the single quantum mechanical harmonic oscillator to develop a working knowledge 
for a + and a~ in the most simple setting. 

To put the following material in the proper modest perspective we may phrase it as an approach 
which rather than employing the coordinate y and the differential operator d/dy uses the operators 

Obviously, one can express y = y/2(a~ + a + ) and d/dy = \/2{a~ — a + ) and, hence, the approaches 
using y, d/dy and a + , a~ must be equivalent. 

Calculus of Creation and Annihilation Operators 

We summarize first the key properties of the operators a + and a~ 

[a~,a + ] = 11 

a~ My) = 

a + 4>n(y) = Vn + i 4> n +i{y) 

a~ <j) n {y) = y/ri(f> n -i(y) 
(a-f\g)n x =(f\& + g)n oc . (4.83) 

We note that these properties imply 

My) = (a+)> (y)/v^! . (4.84) 

We will encounter below functions of a + and a - , e.g., /(a + ). Such functions are defined for 
/ : R — > R in case that the Taylor expansion 

is convergent everywhere in R. Here f^\yo) denotes the v-th derivative of f(y) taken at y = yo. 
In this case we define 

/(a+) = £&). {&+ y (4 . 86) 

!/=0 
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and similarly for /(a ). The following important property holds 

a-/(a+) = /M(a + ) + /(a + )a- . (4.87) 

In particular, 

a-f(a + )My) = f {1) (a + )My) (4-88) 
which follows from a" 4> (y) = 0. We note 

o-/(o+) = [a-,/(a+)] + /(a + )a- (4.89) 

which implies that in order to prove (4.87, 4.88) we need to show actually 

,/(&+)} = / (1) (a+). (4.90) 

To prove (4.90) we show that (4.90) holds for any function f n (& + ) = (a + ) n which is a power of 
a + . The convergence of the Taylor expansion ascertains then that (4.90) holds for f(a + ). 

We proceed by induction noticing first that (4.90) holds for /o and for f\. The first case is trivial, 
the second case follows from 

[a-,h(a + )] = [a~,a + ] = 11 = f[ 1 \a+) . (4.91) 

Let us assume that (4.90) holds for f n . For f n +i follows then 

[a-,f n+1 (a+)\ = [a-,(a + ) n a + ] = (a + ) n [a~,a + ] + [a-,(a + ) n ]a + 

= (a + ) n l + n(a+) n - 1 a+ = (n + 1) (a+) n (4.92) 

Since any function f(y) in the proper function space A/i can be expanded 

f( y ) = Y, d nMy) = T.^-^My) (4-93) 



n=0 n=0 v n - 



one can reduce all operators acting on some proper state function by operators acting on the state 
4>o(y). Hence, property (4.88) is a fundamental one and will be used in the following, i.e., we will 
only assume operator functions acting on 4>o(y). As long as the operators act on 4>o(y) one can 
state then that a~ behaves like a differential operator with respect to functions /(a + ). Note that 
an immediate consequence of (4.88) is 

(a~) n f(a + ) My) = f (n) (a + ) My) ■ (4-94) 

We like to state the following property of the functions /(cr* 1 ) 

{f{a~)m = W(a + )V> • (4.95) 

This identity follows from (4.83) and can be proven for all powers f n by induction and then inferred 
for all proper functions /(a^). 

An important operator function is the exponential function. An example is the so-called shift 
operator exp(ua~). It holds 

e u& ~ /(a+) My) = /(« + + «) My) ■ (4-96) 
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To prove this property we expand exp(ua ) 

CO „ CO v 

^ (&-r /(« + ) My) = E li My) = + n) My) ■ (4 - 97) 

i/=0 ' u=0 

An example in which (4.96) is applied is 

e ud -e vd+ Mv) = e v ^ ++u Uo(y) = e uv e™ + My) • (4-98) 

The related operators exp[ua + ± v*a~] play also an important role. We assume in the following 
derivation, without explicitly stating this, that the operators considered act on <j>o(y). To express 
these operators as products of operators exp(va + ) and exp(v*a~) we consider the operator C{u) = 
exp(uza + ) exp(uz*a~) where u G R, z G C, FF* = ^ and determine its derivative 

^ = za + e M2a+ e uz * a ~ + e uz&+ z*a~ e uz * & ~ = 

(za + + z*cT) e uz&+ e uz * & ~ - \z*a-,e uz&+ ] e uz *»~ (4.99) 

Using (4.90) and zz* = 1 we can write this 

£C(u) = {za + + z*aT) e u ™ + e uz *"~ - ue uzd+ e uz **~ = 

(za + + z*a- - u) C{u) . (4.100) 

To solve this differential equation we define C(u) = D(u) exp(— u 2 /2) which leads to 

d 



du 



D(u) = (za + + z*a~) D{u) (4.101) 



C(u) obviously obeys C(0) = E. This results in D(0) = H and, hence, the solution of (4.101) 
is D(u) = exp(uza + + uz*d~). One can conclude then using the definition of C(u) and defining 

V = uz 

e v&+ + v * a ~My) = e*™* e v&+ e v ' & ~ My) ■ ( 4 - 102 ) 



Similarly, one obtains 



My) = e~* vv e va e~ va My) (4-103) 

e v&+ + v * & ~My) = e~^ vv * e v * & ~ e v&+ My) ( 4 -104) 

e v&+ - v *' a ~ My) = e^ vv * e~ v * & ~ e vh+ Mv) ■ (4-105) 
Below we will use the operator identity which follows for the choice of v = a in (4.105) 

e aa+- a *a- My) = ^ e -a*a" g aa+ _ (4 1Q6) 

Applying (4.98) for u = —a* and u = a yields 

e -a*a- e «a+^ (y) = e -«* g aa+ _ (4 10?) 

This together with (4.106) yields 

0o(2/) = e"— e aa 0o(y) • (4-108) 



4.5: Working with Creation and Annihilation Operators 



87 



Generating Function in Terms of a + 

We want to demonstrate now that generating functions are as useful a tool in the calculus of the 
ladder operators as they are in the calculus of differential operators. We will derive the equivalent 
of a generating function and use it to rederive the values H n (0) and the orthonormality properties 

of 4> n {y)- 

We start from the generating function (??) and replace according to (4.58) the Hermite polynomials 
H n (y) by the eigenstates <p n (y) 

e 2 ^ 2 = tt! f; ^V/ 2 <My) (4.109) 

Using (4.84) and defining y/2t = u one can write then 

(2 2 \ 00 n 

- V - + V2uy - y = £ ^(^TMy) (4-110) 

or 

(2 2 \ 

-|- + V2uy " yj = e" a+ <Ao(y) • (4.111) 

This expression, in the ladder operator calculus, is the equivalent of the generating function (??). 
We want to derive now the values H n (0). Setting y = in (4.111) yields 

i / u 2 



vr-iexp^-yj = e ua </>o(0) . (4.112) 
Expanding both sides of this equation and using (4.84, 4.58) one obtains 

Comparision of all terms on the l.h.s. and on the r.h.s. provide the same values for H n (0) as provided 
in Eq. (??). 

Similarly, we can reproduce by means of the generating function (4.111) the orthonormality prop- 
erties of the wave functions <p n {y)- For this purpose we consider 

{e uh+ <j) Q \e™ + <M = e™ + 4>o) (4.114) 

where we have employed (4.95). Using (4.98) and again (4.95) one obtains 

(e™ + <t> \e va+ c/> ) = (Me uv e v&+ fo) = e uv (e v& ~ o |0o> = e uv (4.115) 

where the latter step follows after expansion of e va and using a~ (j>o(y) = 0. Expanding r.h.s. and 
l.h.s. of (4.115) and using (4.84) yields 

-j=t=, (<f>M = E -nr ( 4 - 116 ) 



from which follows by comparision of each term on both sides {<pfi\<pu) = $yv, i.e., the expected 
orthonormality property. 
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4.6 Momentum Representation for the Harmonic Oscillator 

The description of the harmonic oscillator provided so far allows one to determine the probability 
density P(x) of finding an oscillator at position x. For example, for an oscillator in a stationary 
state (fin(x) as given by (2.235), the probability density is 

P(x) = \Mx)\ 2 • (4-117) 

In this section we want to provide a representation for the harmonic oscillator which is most natural 
if one wishes to determine for a stationary state of the system the probability density P(p) of finding 
the system at momentum p. For this purpose we employ the Schrodinger equation in the momentum 
representation. 

In the position representation the wave function is a function of x, i.e., is given by a function ip(x); 
the momentum and position operators are as stated in (4.6) and the Hamiltonian (4.7) is given by 
(4.2). In the momentum representation the wave function is a function of p, i.e., is given by the 
function i^ip)'-, the- momentum and position operators are 

p = p , x = ih^- , (4.118) 
dp 



and the Hamiltonian (4.7) is 



H = ^P 2 (4-119) 
2m 1 2 dp 2 y ' 

Accordingly, the time-independent Schrodinger equation for the oscillator can be written 

— ^ + ^P 2 )MP) = E MP ). (4.120) 
Multiplying this equation by 1 / m 2 u) 2 yields 

2^nJp~ 2 + \ m ^P 2 ) ^(P) = EMp) (4.121) 
where 

E = E/m 2 uj 2 , u = l/m 2 uj. (4.122) 

For a solution of the Schrodinger equation in the momentum representation, i.e., of (4.121), we 
note that the posed eigenvalue problem is formally identical to the Schrodinger equation in the 
position representation as given by (4.2, 4.3). The solutions can be stated readily exploiting the 
earlier results. The eigenvalues, according to (4.37), are 

E n = hu(n + ±), n = 0,1,2,... (4.123) 

or, using (4.122), 

E n = hw{n+\)), n = 0,1,2,... (4.124) 

As to be expected, the eigenvalues are identical to those determined in the position representation. 
The momentum representation eigenfunctions are, using (2.235), 



Mp) = ( ' >y 



V2~ r 



muj 
irh 



ex P |-=f (4,25) 
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or with (4.122) 



4>n(p) = 



(-i) n 
V2 n n\ 



1 



irmhuj 



exp 



2mhw) n ^ mfko 



p). 



(4.126) 



Normalized eigenfunctions are, of course, denned only up to an arbitrary phase factor e ia , a£l; 
this allowed us to introduce in (4.125, 4.126) a phase factor (— i) n . 

We can now state the probability density P n {j>) for an oscillator with energy E n to assume momen- 
tum p. According to the theory of the momentum representation holds 



Pn(p) = \<t>n(P)f 



or 



Pnip) = 



2 n nl 



1 



exp 



mhuo 



mhuo 



p). 



(4.127) 



(4.128) 



We may also express the eigenstates (4.126) in terms of dimensionless units following the procedure 
adopted for the position representation where we employed the substitution (4.30) to obtain (4.58). 
Defining k = ^/muj/h p or 



k 



mhuj 



P 



one obtains, instead of (4.126), 



»(*) 



(-*)» 



y/2 n n\ ^Tk 



e~^H n {k) . 



(4.129) 



(4.130) 



We want to finally apply the relationship 

4>n{p) = 



2tt 



dx exp(—ipx/h) 4> n (x) 



in the present case employing dimensionless units. From (4.30) and (4.129) follows 

ky = px I H 

and, hence, 



»(*) 



i r+°° 
I — / 

V2"7T J -oo 



dy exp(-ifcy) <p n (y) 



(4.131) 



(4.132) 



(4.133) 



where we note that a change of the normalization of 4> n (y) [c.f. (4.30) and (2.235)] absorbs the 
replacement dx — > dy, i.e., the Jacobian. This implies the property of Hermite polynomials 



(-i) n e~H n (k) 



+oo 



dy exp(-iky) e 2 H n (y) 



(4.134) 



Exercise 4.6.1: Demonstrate the validity of (4.134), i.e., the correctness of the phase factors 
(— i) n , through direct integration. Proceed as follows. 
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(a) Prove first the property 

f( k ' z ) = \ dyexp(-iky)g(z,y) (4.135) 

\/ ITT J -oo 

where 

f(k,z) = exp(-2ikz + z 2 -k 2 /2) , g(k, z) — exp (2yz — z 2 — y 2 /2) (4.136) 

(b) Employ the generating function (2.194, 2.196) of Hermite polynomials and equate coefficients 
of equal powers of z to prove (4.134). 



4.7 Quasi- Classical States of the Harmonic Oscillator 

A classical harmonic oscillator, for example, a pendulum swinging at small amplitudes, carries out 
a periodic motion described by 

y(t) = Re(yoe^) (4.137) 

where y{t) describes the center of the particle and where the mass of the particle remains narrowly 
distributed around y(t) for an arbitrary period of time. The question arises if similar states exist 
also for the quantum oscillator. The answer is 'y es '- Such states are referred to as quasi- classical 
states, coherent states, or Glauber states of the harmonic oscillator and they play, for example, a 
useful role in Quantum Electrodynamics since they allow to reproduce with a quantized field as 
closely as possible the properties of a classical field 3 . We want to construct and characterize such 
states. 

The quasi- classical states can be obtained by generalizing the construction of the ground state of 
the oscillator, namely through the eigenvalue problem for normalized states 

a -^ a) {y) = a(f> (a) {y), a G R, (0 (a) | <£ (a) ) = JK . (4.138) 

We will show that a harmonic oscillator prepared in such a state will remain forever in such a 
state, except that a changes periodically in time. We will find that a can be characterized as 
a displacement form the minimum of the oscillator and that the state (p^ a \y) displays the same 
spatial probability distribution as the ground state. The motion of the probability distribution of 
such state is presented in Fig. 4.1 showing in its top row the attributes of such state just discussed. 
We first construct the solution of (4.138). For this purpose we assume such state exists and expand 

2 



^\y) = £ d { n a) My) , E K a) = 1 ( 4 - 139 ) 



n=0 n=0 



where we have added the condition that the states be normalized. Inserting this expansion into 
(4.138) yields using a~ <p n {y) = y/n(/> n -i(y) 

oo 

E (v^4 a) ^n-i(y) - ad^My)) = (4.140) 



n=0 



An excellent textbook is Photons and Atoms: Introduction to Quantum Electrodynamics by C.Chen-Tannoudji, 
J.Dupont-Roc, and G.Grynberg (John Wiley & Sons, Inc., New York, 1989) which discusses quasi-classical states of 
the free electromagnetic field in Sect. — III.C.4, pp. 192 
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or 

oo 

J2 (^+l4 Q i - «4 Q) ) Mi/) = • (4.141) 

n=0 

Because of the linear independence of the states <f> n (y) ah coefficients multiplying <p n {y) must vanish 
and on obtains the recursion relationship 

d { :l = (4-142) 
\Jn + 1 



The solution is 



= ^Lc (4.143) 



for a constant c which is determined through the normalization condition [see (4.139)] 

oo „ oo 2 

1 = = | c p ^ ^ = |c| 2 e« 2 . (4.144) 



n=0 
,2 



n! 

?1=0 



Normalization requires c = exp(— a /2) and, hence, the quasi-classical states are 

(2 \ oo n 

Using the generating function in the form (4.109), i.e., 



n=0 



[ir\ * e 2 
one can write 



(2 \ oo ra 

7 71=0 V 

(y) = [7r]-iexp(-^(y - v^a) 2 ) (4.147) 

which identifies this state as a ground state of the oscillator displaced by y/2a. This interpretation 
justifies the choice of a € I in (4.138). However, we will see below that a € C are also admissible 
and will provide a corresponding interpretation. 
One can also write (4.145) using (4.84) 

(2 \ 00 ( " + \ n * 

- Y E Mf) = e- 9 ^ e ah+ MV) • (4-148) 

' n=0 n ' 

The identity (4.108) allows one to write 

^ a \y) = T+(a)My) = T+(a)[vr]-i e ^ 2 /2 (4 . 149) 

where 

T+(a) = e ^ + -«*«- (4.150) 

for a presently real. Comparing (4.149) with (4.147) allows one to interprete T{a) as an operator 
which shifts the ground state by y/2a. 
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The shift operator as defined in (4.150) for complex a obeys the unitary property 

T + (a)T(a) = 1 (4.151) 

which follows from a generalization of (4.95) to linear combinations aa + + fiaT and reads then 

(f{aa + + /3a - ) = (4>\f{a* a - + /3*a+)^) • (4.152) 

Since the operator function on the r.h.s. is the adjoint of the operator function on the l.h.s. one 
can write in case of T + {a) 



(T+(a)) + = T(a) = e a *^ 



■tto + . g-(ao+-a*a ) 



(4.153) 



This is obviously the inverse of T + {a) as defined in (4.150) and one can conclude that T + {a) is a 
so-called unitary operator with the property 



T+(a)T(a) = T(a)T+(a) = 1 
or applying this to a scalar product like (4.152) 

{T + (a)f\T + (a)g) = (/ \T( a )T + (a) g) = (/ \g) . 

In particular, it holds 

(T + (a)0 o |T + (a)0o) = (/ \T{a) T+(a) g) = (fa , a G C 



(4.154) 



(4.155) 



(4.156) 



, i.e., the shift operator leaves the ground state normalized. 

Following the procedure adopted in determining the propagator of the harmonic oscillator [see 
(4.67, 4.68)] one can write the time-dependent solution with (4.145) as the initial state 



^ a \yM) = exp 



a 
~2 



ex P ( -*w(n + -)(ti - t )j —=(j) n {y) 

?1=0 ' _ /V fi. 



(4.157) 



or 



(2 \ u<j / \ n 
" yj E T^-^«( y ) ' u = e-^-' o) • (4.158) 

' 71=0 

Comparision of this expression with (4.145) shows that a in (4.145) is replaced by a complex number 
au which at times t\ = to + 2irn/LU, n = 0, 1, . . . becomes real. Relationship (4.109) serves again 
to simplify this sum 



(y, ti) = 7r 4 ^2 exp 



\ 



a 2 (u 2 + 1) 



(4.159) 



ii 



Noticing the identity which holds for \u\ 



u 2 + 1 = (Reu) 2 - (Iran) 2 + 1 + 2iReulmu = 2(Reu) 2 + 2iReulmu (4.160) 
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the exponent E can be written 

,2 



E = — ^- + V2ayReu — a 2 (Reu) 2 + i V2aylmu — ia 2 Reu!mu 

= — - (^y — \/2aReu^ + i V2j/almu — i a 2 Reu Imu (4.161) 



displaced Gaussian 



momentum phase term 



and is found to describe a displaced Gaussian, a momentum factor and a phase factor. It is of 
interest to note that the displacement yo of the Gaussian as well as the momentum ko and phase 
4>o associated with (4.161) are time-dependent 

yo{t\) = V2 acosuj(ti - t ) 
ko(ti) = — \/2 Q!sinti;(ti — to) 

Mh) = ~ ^a 2 sin2i^i - t ) (4.162) 

with a periodic change. The oscillations of the mean position yo(h) and of the mean momentum 
ko(t\) are out of phase by | just as in the case of the classical oscillator. Our interpretation 
explains also the meaning of a complex a in (4.138): a real a corresponds to a displacement such 
that initially the oscillator is at rest, a complex a corresponds to a displaced oscillator which has 
initially a non-vanishing velocity; obviously, this characterization corresponds closely to that of the 
possible initial states of a classical oscillator. The time-dependent wave function (4.159) is then 

<j) {a) {yM) = vr-3exp^-^[y-yo^i)] 2 + W Hh) ~ i<fo(ti) " - *o)) . (4.163) 

This solution corresponds to the initial state given by (4.138, 4.147). 

In Fig. 4.1 (top row) we present the probability distribution of the Glauber state for various 
instances in time. The diagram illustrates that the wave function retains its Gaussian shape with 
constant width for all times, moving solely its center of mass in an oscillator fashion around the 
minimum of the harmonic potential. This shows clearly that the Glauber state is a close analogue 
to the classical oscillator, except that it is not pointlike. 

One can express (4.163) also through the shift operator. Comparing (4.148, 4.150) with (4.158) 
allows one to write 

^ a \y,h) = u2T + (au)My) (4-164) 

where according to (4.150) 

T+(au) = exp(a(e-^' 1 -*°)a + - e*^* 1 "* ^- ) ) . (4.165) 
This provides a very compact description of the quasi-classical state. 



Arbitrary Gaussian Wave Packet Moving in a Harmonic Potential 

We want to demonstrate now that any initial state described by a Gaussian shows a time dependence 
very similar to that of the Glauber states in that such state remains Gaussian, being displaced 
around the center with period 2tt/uj and, in general, experiences a change of its width (relative to 
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Figure 4.1: Time dependence of Gaussian wave packets in harmonic oscillator potential. 



that of the Glauber states) with a period tt/uj. This behaviour is illustrated in Fig. 4.1 (second and 
third row) . One can recognize the cyclic displacement of the wave packet and the cyclic change of 
its width with a period of twice that of the oscillator. 
To describe such state which satisfies the initial condition 

<Ka ,*\y,to) = [7ra 2 ]-iexp(- ^ ) (4.166) 

we expand 

oo 

4>(a o ,a\y,t ) = 'Y^d n (j) n {y) . (4.167) 

n=0 

One can derive for the expansion coefficients 



where 



f+OO 




/ dy 4> n (y) <f>( 


J — oo 




1 


r+oc 


\j2 n n\-KO 


J C 


e -a a q/2 


r+oo 


\Jl n n\'Ko 


J ° 

J — oo 




1 + tT 


P = 


2a 2 



(y - Qq) 2 _ y 

2a 2 2 



H n (y) 



1 + a 2 



(4.168) 



(4.169) 
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The solution of integral (4.168) is 4 



e -g q /2 {p _ ir /2 



p 



p-i 



)■ 



(4.170) 



In order to check the resulting coefficients we consider the limit a — ► 1 . In this limit the coefficients 
d n should become identical to the coefficients (4.143) of the Glauber state, i.e., a ground state shifted 
by a Q = y/2a. For this purpose we note that according to the explicit expresssion (??) for H n (y) 
the leading power of the Hermite polynomial is 



H n (y) = (2y) n 



0{y n ~ 2 ) for n > 2 
1 for n = 0, 1 



and, therefore, 



lim 

o— >i 



P- 1\ 2 
V 



H n {q 



One can then conclude 



— l — ) = lim(2g) n = a n . 

p-1 a^V 



-a 2 /4 



lim d n = — 



(4.171) 

(4.172) 
(4.173) 



which is, in fact, in agreement with the expected result (4.143). 

Following the strategy adopted previously one can express the time-dependent state corresponding 
to the initial condition (4.166) in analogy to (4.157) as follows 



4>{a ,cr\y,ti) = 



(4.174) 



-aq/2 



This expansion can be written 
<f)(a ,a\y,ti) ■- 



exp 



1 ° 2 + I ?/ 2 

2 1+ct 2 ^ 2»o 2 



- t ) ) 



E^o^r^n(yo)e-*2/ 2 ^„(y)e- 



where 



P 



P-1 vr 



1 - a 2 
1 + a 2 



-ioj(ti— 1 ) 



The generating function (4.76) permits us to write this 



<f)(a ,a\y,ti) 



(4.175) 
(4.176) 

(4.177) 



x exp 



2\ 1+t 2 



1 a 2 
2 



4 This integral can be found in Integrals and Series, vol. 2 by A. P. Prudnikov, Yu. Brychkov, and O.I. Marichev 
(Wiley, New York, 1990); this 3 volume integral table is likely the most complete today, a worthy successor of the 
famous Gradshteyn, unfortunately very expensive, namely $750 
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Chapter 5 

Theory of Angular Momentum and 
Spin 

Rotational symmetry transformations, the group S0(3) of the associated rotation matrices and the 
corresponding transformation matrices of spin-^ states forming the group SU(2) occupy a very 
important position in physics. The reason is that these transformations and groups are closely tied 
to the properties of elementary particles, the building blocks of matter, but also to the properties of 
composite systems. Examples of the latter with particularly simple transformation properties are 
closed shell atoms, e.g., helium, neon, argon, the magic number nuclei like carbon, or the proton 
and the neutron made up of three quarks, all composite systems which appear spherical as far as 
their charge distribution is concerned. In this section we want to investigate how elementary and 
composite systems are described. 

To develop a systematic description of rotational properties of composite quantum systems the 
consideration of rotational transformations is the best starting point. As an illustration we will 
consider first rotational transformations acting on vectors r in 3-dimensional space, i.e., f 6 Kj^, 
we will then consider transformations of wavefunctions ip(r) of single particles in Rj^, and finally 
transformations of products of wavefunctions like YljLi V'j(^) which represent a system of N (spin- 
zero) particles in M^. 

We will also review below the well-known fact that spin states under rotations behave essentially 
identical to angular momentum states, i.e., we will find that the algebraic properties of operators 
governing spatial and spin rotation are identical and that the results derived for products of angular 
momentum states can be applied to products of spin states or a combination of angular momentum 
and spin states. 



5.1 Matrix Representation of the group SO (3) 

In the following we provide a brief introduction to the group of three-dimensional rotation matrices. 
We will also introduce the generators of this group and their algebra as well as the representation 
of rotations through exponential operators. The mathematical techniques presented in this section 
will be used throughout the remainder of these notes. 
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Properties of Rotations in 

Rotational transformations of vectors r 6 M.^, in Cartesian coordinates f = (x±,X2, x^) T , are linear 
and, therefore, can be represented by 3 x 3 matrices R($), where # denotes the rotation, namely 
around an axis given by the direction of $ and by an angle We assume the convention that 
rotations are right-handed, i.e., if the thumb in your right hand fist points in the direction of 
then the fingers of your fist point in the direction of the rotation. # parametrizes the rotations 
uniquely as long as < 2ir. 
Let us define the rotated vector as 

r' = R0)r . (5.1) 

In Cartesian coordinates this reads 

3 

4 = J2l R (#)]kjXj k = 1,2,3 . (5.2) 

3=1 

— * 

Rotations conserve the scalar product between any pair of vectors a and b, i.e., they conserve 
a-b = Y^=i a jbj- It holds then 

3 3 

j,k,e=i j=i 
Since this holds for any a and 6, it follows 

3 

J2[R(3)} jk [R(#)} je = 5 M . (5.4) 

3=1 

With the definition of the transposed matrix R T 

[R T ]jk = R kj (5.5) 

this property can be written 

R{$)R T {$) = R T (S)R(S) = 1 . (5.6) 

This equation states the key characteristic of rotation matrices. From (5.6) follows immediatety 
for the determinant of R($) using det A B = det^4 det-B and detvl^ = det A 

detR(S) = ±1 . (5.7) 

Let us consider briefly an example to illustrate rotational transformations and to interpret the sign 
of deti?($). A rotation around the X3-axis by an angle ip is described by the matrix 

(cos</9 — sini^ \ 
sinv? coscp (5.8) 
1/ 

In case of a prefactor +1, the matrix represents a proper rotation, in case of a prefactor —la 
rotation and an inversion at the origin. In the latter case the determinant of R("d = (0, 0, (f) T ) is 
negative, i.e. a minus sign in Eq.(5.7) implies that a rotation is associated with an inversion. In 
the following we want to exclude inversions and consider only rotation matrices with deti? = 1. 
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Definition of the Group SO(3) 

We will consider then the following set 

SO(3) = {real3 x 3 matrices^; RR T = R T R = 1, detR = 1} (5.9) 

This set of matrices is closed under matrix multiplication and, in fact, forms a group Q satisfying 
the axioms 

(i) For any pair a, b G Q the product c = a o b is also in Q. 

(ii) There exists an element e in Q with the property Va 6 5 -> e o a = a o e = a. 

(iii) Va, aG^-* 3a" 1 £ (5 with a o a -1 = a -1 o a = e. 

(iv) For products of three elements holds the associative law 
a o (6 o c) = (a o 6) o c. 

We want to prove now that SO(3) as defined in (5.9) forms a group. For this purpose we must 
demonstrate that the group properties (i-iv) hold. In the following we will assume R\, R2 € SO (3) 
and do not write out "o" explicitly since it represents the matrix product. 

(i) Obviously, R3 = R1R2 is a real, 3 x 3-matrix. It holds 

R% -R3 = ( Ri R2 )~^ -Ri R2 = R2 R\ Ri -R2 = R2 ^2 = 1 • (5.10) 
Similarly, one can show R3R3 = 1. Furthermore, it holds 

deti? 3 = det (R1R2) = detR 1 detR 2 = 1. (5.11) 
It follows that i?3 is also an element of SO (3). 

(ii) The group element 'e' is played by the 3x3 identity matrix E. 

(iii) From deti?i 7^ follows that Ri is non-singular and, hence, there exists a real 3 x 3-matrix 
R^ 1 which is the inverse of R\. We need to demonstrate that this inverse belongs also to 
SO(3). Since (i^T = (^f) _1 it follows 

{R^TRi 1 = {Rlr'Ri 1 = (RiRly 1 = l- 1 = 11 (5.12) 
which implies R^ 1 € SO (3). 

(iv) Since the associative law holds for multiplication of any square matrices this property holds 
for the elements of SO (3). 

We have shown altogether that the elements of SO(3) form a group. 

We would like to point out the obvious property that for all elements R of SO (3) holds 

R- 1 = R T (5.13) 



100 



Theory of Angular Momentum and Spin 



Exercise 5.1.1: Test if the following sets together with the stated binary operation 'o' form groups; 
identify subgroups; establish if 1-1 homomorphic mappings exist, i.e. mappings between the sets 
which conserve the group properties: 

(a) the set of real and imaginary numbers — 1, — i} together with multiplication as the binary 
operation 'o'; 

(b) the set of matrices {M 1 , M 2 , M 3 , M 4 } 

={(; ;)•(-! "o o 1 )} 

together with matrix multiplication as the binary operation 'o'; 

(c) the set of four rotations in the x\ , X2-plane {rotation by 0°, rotation by 90°, rotation by 180°, 
rotation by 270°} and consecutive execution of rotation as the binary operation 'o'. 



Generators, Lie Group, Lie Algebra 

We want to consider now a most convenient, compact representation of R($) which will be of 
general use and, in fact, will play a central role in the representation of many other symmetry 
transformations in Physics. This representation expresses rotation matrices in terms of three 3x3- 
matrices J\, J2, J3 as follows 

R($) = exp^-^-J^ (5.15) 

Below we will construct appropriate matrices presently, we want to assume that they exist. 
We want to show that for the property R R T = 1 to hold, the matrices 

L k = ~^Jk (5.16) 

must be antisymmetric. To demonstrate this property consider rotations with •& = fi^k where e& 
is a unit vector in the direction of the Cartesian A;-axis. Geometric intuition tells us 

R^kek)- 1 = R{-°S k e k ) = exp(-^L fc ) (5.17) 

Using the property which holds for matrix functions 

[f(R)f = f(R T ) (5.18) 

we can employ (5.13) 

R^kh)' 1 = R^khf = [exp(tf fe L fe )] T = exp(^L^) (5.19) 
and, due to the uniqueness of the inverse, we arrive at the property 

exp(0 k LZ) = exp(-^L fc ) (5.20) 

from which follows = — Lfc- 
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We can conclude then that if elements of SO(3) can be expressed as R = e A , the real matrix A is 
anti-symmetric. This property gives A then only three independent matrix elements, i.e., A must 
have the form 

a b 

A - I -a c ) . (5.21) 
-b -c 



It is this the reason why one can expect that three-dimensional rotation matrices can be expressed 
through (5.15) with three real parameters #i,#2, $3 and three matrices Li,L2,L 3 which are inde- 
pendent of the rotation angles. 

We assume now that we parameterize rotations through a three-dimensional rotation vector $ = 
(tfi,tf 2 ,#3) T such that for $ = (0,0, 0) T the rotat ion is the identity. One can determine then the 
matrices defined in (5.15) as the derivatives of i?(#) with respect to the Cartesian components 
"&k taken at "d = (0, 0, 0) T . Using the definition of partial derivatives we can state 



Li = Jirn^r 1 (f0 = (-^1,0, 0) T ) - l) 



(5.22) 



and similar for L2 and L3. 

Let us use this definition to evaluate L3. Using (5.8) we can state 



lim R[(O,O,0 3 ) 
#3-^0 



cos$3 — sini93 
lim I sim?3 cos$3 












(5.23) 



Insertion into (5.22) for k = 3 yields 






-03 








-1 


























/ 




V 





/ 



(5.24) 



One obtains in this way 



Ji 



Li 



L 2 



J?> = L3 




(5.25) 



(5.26) 



(5.27) 



The matrices are called the generators of the group SO (3). 



Exercise 5.1.2: Determine the generator L\ according to Eq.(5.22). 
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Sofar it is by no means obvious that the generators Lk allow one to represent the rotation matrices 
for any finite rotation, i.e., that the operators (5.15) obey the group property. The latter property 
implies that for any i?i and #2 there exists a $3 such that 

exp ^ ■ J^j exp ^ ■ fj = exp ^ - % -^ ■ J^j (5.28) 

The construction of the generators through the limit taken in Eq. (5.22) implies, however, that 
the generators represent infinitesimal rotations with 1 >> around the x±-, X2-, and X3-axes. 
Finite rotations can be obtained by applications of many infinitesimal rotations of the type R($ = 
(ei,0,0) T ) = exp(eiLi), R(S = (0,e 2 ,0) T ) = exp(e 2 L 2 ) and R0 = (0, 0, e 3 ) T ) = exp(e 3 L 3 ). The 
question is then, however, if the resulting products of exponential operators can be expressed in 
the form (5.15). An answer can be found considering the Baker-Campbell-Hausdorf expansion 

00 

exp(\A)exp(\B) = exp(J2 \ n Z n ) (5.29) 

n=l 

where Z\ = A + B and where the remaining operators Z n are commutators of A and B, com- 
mutators of commutators of A and B, etc. The expression for Z n are actually rather complicated, 
e.g. Z 2 = \[A, B], Z 3 = ( [A [A B \\ + [[A B], B] ). From this result one can conclude that 
expressions of the type (5.15) will be closed under matrix multiplication only in the case that 
commutators of L^ can be expressed in terms of linear combinations of generators, i.e., it must 
hold 

3 

[Lk, Li] = fklmLm ■ (5.30) 

m=l 

Groups with the property that their elements can be expressed like (5.15) and their generators 
obey the property (5.30) are called Lie groups, the property (5.30) is called the Lie algebra, and 
the constants jkim are called the structure constants. Of course, Lie groups can have any number 
of generators, three being a special case. 

In case of the group SO(3) the structure constants are particularly simple. In fact, the Lie algebra 
of SO(3) can be written 

[Lk, Li] = ekimLm (5.31) 

where ektm are the elements of the totally antisymmetric 3-dimensional tensor, the elements of 
which are 

if any two indices are identical 

1 for k = l,£ = 2,m = 3 



Zkim — \ 



(5.32) 

1 for any even permutation of k = 1, £ = 2, m = 3 
—1 for any odd permutation oi k = 1,£ = 2,m = 3 



For the matrices = ihL^ which, as we see later, are related to angular momentum operators, 
holds the algebra 

[Jk, Je] = ifrtkemJm ■ (5.33) 

We want to demonstrate now that exp($ • L) yields the known rotational transformations, e.g., the 
matrix (5.8) in case of d = (0,0, <f>) T ■ We want to consider, in fact, only the latter example and 
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determine exp^L^). We note, that the matrix according to (5.27), can be written 



ipL 3 = 



A 



, A = ip 



\ / 
One obtains then for the rotational transformation 



-1 

1 



(5.34) 



exp 



A 






000 




(5.35) 



To determine expA = Yl^Li A"/i/\ we split its Taylor expansion into even and odd powers 

y -±-A 2n + y , 1 - , A 2n+1 

The property 



n=0 



n=0 



allows one to write 

-1 

1 

and, accordingly, 



2n 



= (-iy 



-1 

1 



1 
1 



1 
1 



-1 

1 



2n+l 



= ("l) r 



-1 

1 



- 4 = E 



n=0 



(2n)! 



2n(l 0\ (-1)" ~1\ 

[O 1 J + A.(2n + 1)!^ il j 



Recognizing the Taylor expansions of cos if and sin ip one obtains 



e A = 



and, using (5.34, 5.35), 



cos if — sm 99 
sin ip cos 



cos if — sin </? 
sin f cos 93 
1 



(5.36) 



(5.37) 



(5.38) 



(5.39) 



(5.40) 



(5.41) 



which agrees with (5.8). 



Exercise 5.1.3: Show that the generators of SO(3) obey the commutation relationship (5.31). 
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5.2 Function space representation of the group SO(3) 

In this section we will investigate how rotational transformations act on single particle wavefunctions 
tp(f). In particular, we will learn that transformation patterns and invariances are connected with 
the angular momentum states of quantum mechanics. 

Definition 

We first define how a rotational transformation acts on a wavefunction ip{r). For this purpose we 
require stringent continuity properties of the wavefunction: the wavefunctions under consideration 
must be elements of the set Coo(J£), i.e., complex- valued functions over the 3-dimensional space 
which can be differentiated infinitely often. In analogy to being a linear map — > M^, we 
define rotations as linear maps Coo(^) — > CqoQ^), namely 

K(#)ip(r) = ^(R" 1 ^) r) . (5.42) 

Obviously, the transformation 7£(#): Coo(^) — ► Coo(^) is related to the transformation R(i9): 
R\p — > Mj+f, a relationship which we like to express as follows 

110) = p (#(#)) ■ (5.43) 

p( ) conserves the group property of SO(3), i.e. for A, B G SO(3) holds 

p(AB) = p{A)p{B) . (5.44) 

This important property which makes p( SO (3) ) also into a group can be proven by considering 

p (r($i) R02)) ^(r) = ^{[R^i) Ri^)}- 1 ^ (5.45) 
= V([^2)] _1 [iW]-V) = p(R0 2 j) iPdR^i)]- 1 ^ (5.46) 
= p(i?(#i)) p(^2)) Y^l • (5.47) 

Since this holds for any ip(r) one can conclude p ^i?(i?i)^ p ^#(#2)^ = p (^R($i) R^i^j, i.e. the 
group property (5.44) holds. 

Exercise 5.2.1: Assume the definition p' ^i?(i?)^ i/)(r) = ip(R0) f). Show that in this case holds 
p'(AB) = p'(B)p'(A). 



Generators 

One can assume then that 7£(i9) should also form a Lie group, in fact, one isomorphic to SO(3) 
and, hence, its elements can be represented through an exponential 

K(#) = exp(~0-f) (5.48) 
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where the generators can be determined in analogy to Eq. (5.22) according to 



Ji = Jim^r 1 (K0 = (-#1,0, 0) T ) - l) (5.49) 



(note the minus sign of — i?i which originates from the inverse of the rotation in Eq.(5.42)) and 
similarly for J 2 and J 3 . 

The generators Jk can be determined by applying (5.49) to a function /(f), i.e., in case of J 3 , 

= j^/z 1 (K0= (0,0,tf 3 ) T )/(f) - /(f)) 

= lim tfj 1 ( f(R(0,0,-& 3 )r) - f(f) ) . (5.50) 

Using (5.23, 5.24) one can expand 

/ 1 tf 3 \ / Xi \ / X X + $ 3 X 2 \ 

i?(0,0,-tf 3 )f « -<? 3 1 x 2 = -^i + x 2 . (5.51) 

V 1 / V ^ ) \ x 3 ) 

Inserting this into (5.50) and Taylor expansion yields 

—Jzfif) = lim tfj 1 (/(f) + # 3 x 2 d 1 f(r) - # 3Xl d 2 f(r) - /(f) ) 
fl V3— >o 

= (x 2 di - Xl d 2 ) /(f) • (5.52) 
Carrying out similar calculations for J\ and J2 one can derive 

(5.53) 
(5.54) 
(5.55) 

Not only is the group property of SO(3) conserved by p( ), but also the Lie algebra of the generators. 
In fact, it holds 

[Jk, Ji] = ifttkimJm ■ (5.56) 
We like to verify this property for [Ji, J2]. One obtains using (5.53-5.55) and f(f) = f 

[Ji,J2\m = -ft 2 [-i7i,-i7 2 ]/(r) =-h 2 x 
[ (x 3 d 2 - x 2 d 3 ) (xid 3 f - x 3 dif) - (x\d 3 - x 3 di) (x 3 d 2 f - x 2 d 3 f) } 
= -ft 2 [ +xix 3 d 2 d 3 f - xix 2 d$f - x 2 3 did 2 f + x 2 x 3 did 3 f + x 2 dif 
-xid 2 f - xix 3 d 2 d 3 f + x\did 2 f + xix 2 d%f - x 2 x 3 did 3 f } 

= -ft 2 ( x 2 di - Xl d 2 ) /(f) = iftj 3 /(f) . (5.57) 





= x 3 d 2 


- x 2 d 3 


~tJ2 
ft 


= xxd 3 


- x 3 dx 


ft 


= x 2 dx 


- xid 2 



Exercise 5.2.2: (a) Derive the generators Jk, k = 1, 2 by means of limits as suggested in Eq. (5.49); 
show that (5.56) holds for [J 2 , J 3 ] and [J 3 , J x \. 
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Exercise 5.2.3: Consider a wave function tp((p) which depends only on the azimutal angle <fi of the 
spherical coordinate system (r, 8, (f>), i.e. the system related to the Cartesian coordinates through 
x\ = rsin#cos0, x 2 = rsin6>sin0, and xs = rcos9. Show that for J% as defined above holds 
exp(f «7 3 )W) = iP(<f> + a). 



5.3 Angular Momentum Operators 

The generators Jk can be readily recognized as the angular momentum operators of quantum 
mechanics. To show this we first note that (5.53-5.55) can be written as a vector product of f and 
V 

- l -J = -f x V . (5.58) 
From this follows, using the momentum operator p = |V, 

J = f x = r x p (5.59) 

which, according to the correspondence principle between classical and quantum mechanics, iden- 
tifies J as the quantum mechanical angular momentum operator. 

Equation (5.56) are the famous commutation relationships of the quantium mechanical angular 
momentum operators. The commutation relationships imply that no simultaneous eigenstates of 
all three Jk exist. However, the operator 

J 2 = Ji +Jf +Ji (5.60) 

commutes with all three Jk-, k = 1,2, 3, i.e., 

[J 2 ,J k ] =0, A: = 1,2,3. (5.61) 

We demonstrate this property for k = 3. Using [AS, C] = A[B, C] + [A,C]B and (5.56) one 
obtains 

[J 2 ,Js] = [J? + JZ,J 3 } (5.62) 

= -ihJ l J 2 - ihJ 2 Ji + ihJ 2 Ji + ihJ 1 J 2 = o 

According to (5.61) simultaneous eigenstates of J 2 and of one of the Jk, usually chosen as can 
be found. These eigenstates are the well-known spherical harmonics Yn m {6,4>). We will show now 
that the properties of spherical harmonics 

J 2 Y, m {Q^) = h 2 t(t + \)Y lm (Q^) (5.63) 
J3Y tm (9,<j>) = hmY em (6,0) (5.64) 

as well as the effect of the so-called raising and lowering operators 



J± = Ji ± ij 2 



(5.65) 
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on the spherical harmonics, namely, 

J + Y tm (0,<i>) = h^J{£ + m + l){£- m) Y lm+l {6, 0) (5.66) 

J + Y U {9,<P) = (5.67) 

J-Y im+l {e,cj>) = hy/{l + m + l)(l-m)Y tm {0,<j>) (5.68) 

J-Y t - t {0,<l>) = (5.69) 

are a consequence of the Lie algebra (5.56). For this purpose we prove the following theorem. 
An Important Theorem 

Theorem 1.1 Let k = 1,2,3 be operators acting on a Hilbert space H which obey the algebra 
[Lk,Li] = ekim L m and let L± = L\ ± iL 2 , L? = L\ + L\ + L 3 . Let there exist states \££) in 
H with the property 

L + \££) = (5.70) 

L 3 \££) = -i£\££) (5.71) 

then the states defined through 

L_ \£m+ 1) = -i(3 m \£m) (5.72) 



have the following properties 



(i) L + \£m) = -ia m \£m + l) (5.73) 
(ii) L 3 \£m) = -im\£m) (5.74) 
(iii) L 2 \£m) = -£(£ + l)\£m) (5.75) 



To prove this theorem we first show by induction that (i) holds. For this purpose we first demon- 
strate that the property holds for m = £ — 1 by considering (i) for the state L_\££) ~ \££ — 1). It 
holds L + L_\££) = [L+L^ - L_L+ + L_L + )\£ £) = ([L+,L_] + L_L + )\£ £). Noting [L+,L_] = 
[Li + zL 2 , L\ - iL 2 ] = -2iL 3 and L + \£ £) = one obtains L + L_\££) = -2iL 3 \££) = -2£\££), 
i.e., in fact, L+ raises the m-value of \££ — 1) from m = £ — 1 to m = £. The definitions of the 
coefficients a£_i and fle-i yield L + L_\££) = L + (i(3e-\\££ — 1) = — an-\fii-\ \££), i.e. 

ae-iPe-i = 2£ . (*) (5.76) 

We now show that property (iz) also holds for m = £ — 1 proceeding in a similar way. We note 
L 3 L_\££) = ([L 3 ,L_] +L_L 3 )|^). Using [L 3 ,L_] = [Lg^i-iLa] = L 2 + iLi = i (Li - 
iL 2 ) = iL^ and L 3 \U) = -%l\££) we obtain L 3 L_\££) = \iL_ + L_L 3 )\££) = i(-£ + 1) L_\££). 
From this follows that L-\££) is an eigenstate of L 3 , and since we have already shown \££ — 1) = 
(i/3 £ _i)- 1 L_|^) we can state L 3 |« - 1) = -i(£ - 1)\££ - 1). 

Continuing our proof through induction we assume now that property (i) holds for m. We will 
show that this property holds then also for m — 1. The arguments are very similar to the ones 
used above and we can be brief. We consider L + L^\£m) = ( [L + , L_] + L_L + )|^m) = ( —2iL 3 + 
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L-L + )\£m) = (—2m — a m j3 m )\£m) . This implies that (i) holds for m— 1, in particular, from 
L + L-\lm) = — a m ^\(5 m -\\lm) follows the recursion relationship 

a m -i(3 m -i = 2m + a m (3 m . (**) (5-77) 

We will show now that if (ii) holds for m, it also holds for m — 1. Again the argumemts are similar 
to the ones used above. We note L^L-\£m) = ( [L3,L_] + L-L^)\lm) = (iL_ — imL_ )|£m) = 
— i (m — 1) L_|£m) from which we can deduce L^\£m — 1) = — i(m — l)\£m — 1). 
Before we can finally show that property (Hi) holds as well for all — £ < m < £ we need to determine 
the coefficients a m and [3 m . We can deduce from (*) and (**) 

a/-i/3/-i = M 

a e - 2 Pe-2 = 2 (£-!+£) 
a<_ 3 /3/-3 = 2 - 2 + * - 1 + £) 



Obviously, it holds a m /3 m = 2 (Yll=m+i Using the familiar formula ^^ =0 k = n(n+ l)/2 
one obtains 

a m /3 m = (£+1)^ - (m + l)m = + 1) £ + m^ - £m - (m + l)m 

= + m + 1)(£ - m) (5.78) 

One can normalize the states \£m) such that a m = j3 m , i.e., finally 

a m = (3 m = v / (£ + m + l){£ - m) . (5.79) 

We can now show that (iii) holds for all proper m-values. We note that we can write L 2 = 
\L+L- + \L-L + + L\. It follows then l?\£m) = - (\a m -i(3 m ~i + \a m f3 m + m 2 ) \£m) = 
- (l(£ + m + 1)(£ - m) 

+ \ (£ + m)(£ — m + 1) + m 2 ) = — £{£ + l)|£m). This completes the proof of the theorem. 
We note that Eqs. (5.70, 5.72, 5.73, 5.79) read 

L + \£m) = -i^{£ + m + l)(£-m) \£m + 1) (5.80) 
L_|^m + 1) = -i^{£ + m + l){£-m)\£m) , (5.81) 

i.e., yield properties (5.66, 5.69). 

We will have various opportunities to employ the Theorem just derived. A first application will 
involve the construction of the eigenstates of the angular momentum operators as defined in (5.63, 
5.64). For this purpose we need to express the angular momentum operators in terms of spherical 
coordinates (r, 9, (p. 

Angular Momentum Operators in Spherical Coordinates 

We want to express now the generators given in (5.53-5.55), in terms of spherical coordinates 
defined through 

x\ = rsin#cos(/> (5.82) 
%2 = rsin#sin</> (5.83) 
X3 = rcos9 (5.84) 
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We first like to demonstrate that the generators Jk, actually, involve only the angular variables 9 
and (j) and not the radius r. In fact, we will prove 

i d d 

— i:Ji = £i = sin0— + cot6>cosc/>— (5.85) 
n 69 ocp 

% d d 

— -zfo = £2 = - coscp— + cot 9 sin <j>— (5.86) 
n o9 ocp 

-{A = £. =-£ . (5.87) 

To derive these properties we consider, for example, C\f(0,4>) = (x3<92 — x 2 d 3 )f(9, (j>). Using 
(5.82-5.84 ) one obtains, applying repeatedly the chain rule, 

df(9,4>) = (x 3 d 2 - x 2 d 3 )f(9,<f>) (5.88) 

dx 2 /xi d dx 3 /r d 

X3 ^ 7 + ~ 7. 5 ' 89 

0x2 o tan cp 0x2 o cos 9 



9X 2 /Xl Q _ ^_ Q X3 / r Q 

X3 
= 



x 3 d x\x 2 d x\ + x\ d , 

~ x 2 — -5 77—^ /(M) (5-90) 



xi <9tan0 r 3 dcos9 r 3 dcos9 

This yields, using d/dt&ncf) = cos 2 (f) d/d(j) and d/dcos9 = —(1/smO) d/d6, 

\ sm cos a<p 
cos 2 sin sin </> <9 sin 9 sin & . 2 „ d 



sin 6* at* suit? 39 J 

which agrees with expression (5.85). 

We like to express now the operators J 3 and J 2 , the latter defined in (5.60), in terms of spherical 
coordinates. According to (5.87) holds 

Jz = - d<p . (5.92) 



To determine J 2 we note, using (5.85), 

(£i) 2 = (sin^^- + cot # cos 



- + cotflcos,/,- 

d 2 1 d d 2 

sm 2 6——r — — 7T- sm<b cosd> — — + 2 cot9 smcb cose 



dfl 2 s in 2 r r d0 ' —v^^ 



5 ,„ o , d 2 

+ cotfl COS"0 

and similarly, using (5.86), 



+ cot6»cos 2 0— + cot 2 6»cos 2 0^ (5.93) 
39 oep z 



d „ . , a x - 



(^2) 2 = ( -cos0^ + cote sin ^ 
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d 2 



1 d d 2 

= cos"(p—-^ + -^-^-sm^cos^— - 2cot6»sm^cos0^-— - 
dd 2 sm £ o<p ovocp 







+ cot# sin — + cot 9 sin 6 
oO 



dcj) 2 



It follows 



(A) 2 + (C 2 ) 2 + (£ 3 ) 2 = ^ + cot^ + (cot 2 # + 1)^ 



With cot 2 + 1 = l/sin 2 fl, 



sin 



suit 



d_ 
80 



smfc 



d_ 

de 



d 2 n d 
W 2 +cot ^' 



and the relationship between Ck and Jk as given in (5.85, 5.86, 5.87), one can conclude 

- [ sm/9 — ] + - 
sin'" 



J 1 



l sm0 de) + w 



(5.94) 



(5.95) 



(5.96) 



(5.97) 



Kinetic Energy Operator The kinetic energy of a classical particle can be expressed 

2m 2m ' 2mr 2 



p 2 



+ 



T 2 

class 



(5.98) 



where J c i a ss = r x mv is the angular momentum and p r = mr is the radial momentum. The 
corresponding expression for the quantum mechanical kinetic energy operator is 



f = 


h 2 

2m 


V 2 




which follows from the identity 








V 2 = 


d 2 


2 


d 


dr 2 


r 


dr 


i d 2 

r dr 2 


d 2 

dr 2 


2 

r 


d 
dr 



h 2 i d 2 



J 2 



and comparision with (5.97). 



r + 

2m r dr 2 2m r 2 



r 2 sin 2 £ 



sin# 



d_ 
dB 



+ 



d(f> 2 



(5.99) 



(5.100) 
(5.101) 



5.4 Angular Momentum Eigenstates 

According to the theorem on page 107 above the eigenfunctions (5.63, 5.64) can be constructed as 
follows. One first identifies the Hilbert space H on which the generators Ck operate. The operators 
in the present case are Ck = ~\Jk where the Jk are differential operators in 6 and (f> given in (5.85, 
5.86, 5.87). These generators act on a subspace of Coo(^), namely on the space of complex-valued 
functions on the unit 3-dim. sphere S%, i.e. Coo(S^). The functions in Coo(S>i) have real variables 



5.4: Angular Momentum Eigenstates 



111 



0, 4>, < 9 < ir, < (f> < 2ir, and to be admissable for description of quantum states, must be 
cyclic in (j) with period 27r. The norm in EI is defined through 

|/| 2 = / dfi/*(M)/(M) (5.102) 

where d£l = s'm9 d9 d(j) is the volume element of The function space endowed with this norm 
and including only functions for which the integral (5.102) exists, is indeed a Hilbert space. 
The eigenfunctions (5.63, 5.64) can be constructed then by seeking first functions Yu{9,4>) in H 
which satisfy jC + Y^(9, 4>) = as well as £3^(^,0) = —i£Yu(9,(f)), normalizing these functions, 
and then determining the family {Yg m (9, (f>),m = —£, — £ + 1, . . . , £} applying 

C-Y hn+1 {9,<p) = -iy/(£ + m + !){£- m)Y £m {9, <j>) (5.103) 

iteratively for m = £ — 1, £ — 2, . . . , —£. One obtains in this way 



Y tm {9,<t>) = A(e,m) [-J-] Y U {9A) (5-104) 



A(£, m) 



Constructing Y u ->■ Y U -i -»■ -»■ ■■■Yi- 



(£ + m)! 
_(2£)l(£ - m)\ 



(5.105) 



We like to characterize the resulting eigenfunctions Yg m {9, <f>), the so-called spherical harmonics, by 
carrying out the construction according to (5.104, 5.105) explicitly. For this purpose we split off 
a suitable normalization factor as well as introduce the assumption that the dependence on (f> is 
described by a factor exp(im0) 



YU0A) = J 2 4^ 1 (£ + ^)| P Ucos9) . (5.106) 



It must hold 



C + Y u (9,<f>) = (5.107) 

C 3 Y u (9,(f>) = -i£Y u {9,4>). (5.108) 

The latter property is obviously satisfied for the chosen (^-dependence. The raising and lowering 
operators 

C± = d ± iC 2 , (5.109) 

using (5.85-5.87), can be expressed 

^ = ^(m ±iaM h) ■ (5 - 110) 



Accordingly, (5.107) reads 



— + icot^— ) P u {cos9)c ! " - (5.111) 
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or 



d_ 
89 



t cot0 P u {cos9) = 



(5.112) 



where we employed the definition of -P^(cos#) in (5.107). A proper solution of this equation, i.e. 
one for which 

|2 



PTT 

/ d0sin0|P«(cos0)| ; 
J o 

is finite, is 

P tt (cos6») = Af £ sin £ 
as one can readily verify. The normalization factor Ng is chosen such that 



/>7T 

Jo 



d9 sin9 \Y ee {cos9)f 



1 



holds. (5.106, 5.114) yield 



\N ( 



' 4^ (2£)! 7o 



(5.113) 
(5.114) 

(5.115) 
(5.116) 



The integral appearing in the last expression can be evaluated by repeated integration by parts. 
One obtains, using the new integration variable x = cos 0, 



x(i - x 2 y 

,3 



21 









c 


dx(l 








+1 








+ 


2£ 


-1 










+i 


x 2 f 


-l 


-l 



.2\i 



+ 



21 ■ {21 - 2) 
1-3 



dxx (1 — x ) 



2\l-2 



+1 



2£- (21- 2)---2 
1 - 3- 5---(2^ - 1) y_x 
2£-(2£-2)---2 2 



2i? 



(2i)\ 



1 ■ 3 • 5 • • • {2t - 1) 2t + 1 [1 • 3 • 5 ■ ■ ■ (2£ - l)] 2 2^ + 1 
Accordingly, (5.116) implies 



I AT, 



[1 • 3 - 5 - - • (2^ - l)] s 



from which we conclude 



(5.117) 
(5.118) 



N e = {-if 1 ■ 3 ■ 5 • • • {21 - 1) (5.119) 

where the factor (— 1) has been included to agree with convention 1 . We note that (5.106, 5.114, 
5.119) provide the following expression for Y« 



(5.120) 



1 See, for example, "Classical Electrodynamics, 2nd Ed." by J.D. Jackson (John Wiley, New York, 1975) 
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We have constructed a normalized solution of (5.106), namely Yu(0, <j>), and can obtain now, through 
repeated application of (5.103, 5.110), the eigenfunctions Y£ m (6, </>), m = £—1, £ — 2, ...—^defined 
in (5.63, 5.64). Actually, we seek to determine the functions Pe m (cos6) and, therefore, use (5.103, 
5.110) together with (5.106) 



i e 



d . „ 8 



l2£ + l(£-m-l)\ 
4tt (£ + m + l)l 



P em+1 (co S 6)e^ m+1 ^ 



v / (£ + m + l)(£-m) 



l2£+l(£-m)\ 
4vr (£ + m)\ 



P em (co S 0)e im ^. 



(5.121) 



This expressions shows that the factor e m ^, indeed, describes the ^-dependence of Y£ m (9, <p); every 
application of £_ reduces the power of by one. (5.121) states then for the functions P^ m (cos6>) 



+ (m + l)cot0 P em +i(cos6) = - (£ + m + 1) {£ - m) P<? m (cos 6) . 



d_ 

The latter identity can be written, employing again the variable x = cos 6, 

-1 



P(.m{x) = 



(£ + m+l)(£-m) 



(5.122) 



(5.123) 



d 



P em+l (x) 



dx ' (1 _ x 2)| 

We want to demonstrate now that the recursion equation (5.123) leads to the expression 



Ptm{x) = 



1 (£ + m)\ 
¥l\ {£ 



my 



l-x<) 



dx 1 ' 



(x 2 - 1? 



(5.124) 



called the associated Legendre polynomials. The reader should note that the associated Legendre 
polynomials, as specified in (5.124), are real. To prove (5.124) we proceed by induction. For m = £ 
(5.124) reads 



Pu(x) = 



m 

2 e £l 



(1 - x 2 )~2 ( x 2 - 1) 



= (-1) £ 1 -3-5---(2£- 1) sm e 9 
which agrees with (5.114, 5.119). Let us then assume that (5.124) holds for m + 1, i.e., 



(x) = 



1 {£ + m+l)\ o,-m+± d l - m - x 



2 l £\ (£-m-l)\ 
Then holds, according to (5.123), 

-1 (£ + m)\ 



(5.125) 



(5.126) 



Pem(x) 



2 e £\ (£-m)l 

x (i - x 2 y^ 



a 



(1 - x z )2— + (m+1) 



m + 1 O 



dx 

:2 1 \i 



(1 -X 2 )5 



dx l ~ 



T (x 2 - If ■ 



(5.127) 
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By means of 

-('-^-^Irf 2 - 1 )' 

a£—m 

- (m+1) ^T (1 - l2r=P l^ (5J28) 

one can show that (5.127) reproduces (5.124) and, therefore, that expression (5.124) holds for m if 
it holds for m + 1. Since (5.124) holds for m = £, it holds then for all m. 
We want to test if the recursion (5.123) terminates for m = —£, i.e., if 

C-Y u {6,4>) = (5.129) 

holds. In fact, expression (5.124), which is equivalent to recursive application of yields a 
vanishing expression for m = — £ — 1 as long as £ is a non-negative integer. In that case (1 — x 2 ) 1 
is a polynomial of degree 2£ and, hence, the derivative d 2i+1 / dx 2£+1 of this expression vanishes. 

Constructing Y e _ e -> Y^+i -> ■ ■ ■ Y« 

An alternative route to construct the eigenfunctions Y^ m {6, <f>) determines first a normalized solution 
of 

C-f(0,<f>) = (5.130) 
C 3 f(0,<f>) = i£f(9,4>), (5.131) 

identifies f(0,(f>) = Yi-i{Q,4>) choosing the proper sign, and constructs then the eigenfunctions 
Yg-£ + i, Ye_£ + 2, etc. by repeated application of the operator £+. Such construction reproduces 
the eigenfunctions Yi m {9, (p) as given in (5.106, 5.124) and, therefore, appears not very interesting. 
However, from such construction emerges an important symmetry property of Y£ m (9, </>), namely, 

Y e * m (e,4>) = (-l) ro Y,_ m (M) (5-132) 

which reduces the number of spherical harmonics which need to be evaluated independently roughly 
by half ; therefore, we embark on this construction in order to prove (5.132). 
We first determine Y£-t(0,(/)) using (5.106, 5.124). It holds, using x = cos0, 

11 id 21 

P ^ X) = wim {l ~ x2) ~ 2 d^ {x2 ~ lf - (5 - 133) 

Since the term with the highest power of (x 2 — l) e is x 2£ , it holds 

^u{x 2 -lf = {2£)\ (5.134) 

and, therefore, 

Pe-e(x) = ^[(i-^) 1 • ( 5 - 135 ) 
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Due to (5.106, 5.124) one arrives at 



We note in passing that this expression and the expression (5.120) for Yfj obey the postulated 
relationship (5.132). 

Obviously, the expression (5.136) is normalized and has the proper sign, i.e., a sign consistent with 
the family of functions Y^ m constructed above. One can readily verify that this expression provides 
a solution of (5.129). This follows from the identity 

4 - icoW^-\ sin*0e-"* = 0. (5.137) 
O0 oq> ) 

One can use (5.136) to construct all other Ye m (6,(f>). According to (5.80) and (5.110) applies the 
recursion 

+ icote ^) Y em (e,4>) = y/(e + m+l)(e-m)Y em+1 (e,<j>) . (5.138) 
Employing (5.106), one can conclude for the associated Legendre polynomials 



>U — m)\fd 

' mcotO ) P( m (cos9) 



(£ + m)\ \d0 



= y/(£ + m + l)(l - m) J | - ^ + j| Ptm+iicoB 9) (5.139) 



or 



P lm+1 {cos0) = ^ - mcotO^j P lm (cos0) . (5.140) 
Introducing again the variable x = cos leads to the recursion equation [c.f. (5.122, 5.123)] 

P Em+1 (x) = ( -(l-x 2 ) 1 *-®- - m ] P tm (x). (5.141) 

We want to demonstrate now that this recursion equation leads to the expression 

( _-\\m pii+m 

For this purpose we proceed by induction, following closely the proof of eq. (5.124). We first note 
that for m = — I (5.142) yields 

p ^ x ) = w (1 " x2y 1 {x2 " lY = ¥I\ (1 " x2)l (5 ' 143) 
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wich agrees with (5.135). We then assume that (5.142) holds for m. (5.141) reads then 

Ptm+i(x) = (5.144) 

(-< i -* 2 >'!-™^) 

Replacing in (5.128) m + 1 — > — m or, equivalently, m — > — m — 1 yields 

-o-^sO-*")*!^-* 



+ m -r (1 - x 2 )t . (5.145) 



Hence, (5.144) reads 



WW = (1 " ^ |^ (- 2 " 1)' , (5-146) 

i.e., (5.142) holds for m + 1 if it holds for m. Since (5.142) holds for m = — £ we verified that it 
holds for all m. 

The construction beginning with Yg-g and continuing with Yg_g + i, Y£_^ +2 , etc. yields the same 
eigenfunctions as the previous construction beginning with Ygg and stepping down the series of 
functions Ygg-i, l«-2, etc. Accordingly, also the associated Legendre polynomials determined this 
way, i.e., given by (5.142) and by (5.124), are identical. However, one notes that application of 
(5.124) yields 



the r.h.s. of which agrees with the r.h.s. of (5.142). Hence, one can conclude 

Pg m ( X ) = (-lr^^Pl-rnix) • (5.148) 

According to (5.106) this implies for Y£ m (9,(f>) the identity (5.132). 

The Legendre Polynomials 

The functions 

Pt(x) = Peo(x) (5.149) 
are called Legendre polynomials. According to both (5.124) and (5.142) one can state 
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The first few polynomials are 

P (x) = 1 , Pi{x) = x , P 2 (x) = |(3x 2 - 1) 

P 3 (x) = § (5x 3 - 3x) . (5.151) 

Comparision of (5.150) and (5.142) allows one to express the associated Legendre polynomials in 
terms of Legendre polynomials 

am 

Pim(x) = (-l) m (l-x 2 )^—P e (x),m > (5.152) 
and, accordingly, the spherical harmonics for m > 

Q \ m 

" ' Plco+U),"" . 



d cos 9 



The spherical harmonics for m < can be obtained using (5.132). 

The Legendre polynomials arise in classical electrodynamics as the expansion coefficients of the 
electrostatic potential around a point charge q/\f\ — r*2| where q is the charge, f\ is the point where 
the potential is measured, and r*2 denotes the location of the charge. In case q = 1 and 1 7^2 1 < \ri\ 
holds the identity 2 



where 7 is the angle between f\ and r^. Using x = cos 7, t = r<ijr\, and 

|ri - r 2 | = n Vl -2xt + t 2 , (5.155) 

(5.154) can be written 

^ 00 

tu(x, t) is called a generating function of the Legendre polynomials 3 . 

The generating function allows one to derive useful properties of the Legendre polynomials. For 
example, in case of x = 1 holds 

00 

w(l,t) = , = = Y t e . (5.157) 

V ; Vl-2t + t 2 1 - t ^ V ^ 

Comparision with (5.156) yields 

P/(l) = 1 , £ = 0, 1, 2, ... (5.158) 



2 see, e.g., "Classical Electrodynamics, 2nd Ed." by J.D. Jackson (John Wiley, New York, 1975), pp. 92 

3 To prove this property see, for example, pp. 45 of "Special Functions and their Applications" by N.N. Lebedev 

(Prentice-Hall, Englewood Cliffs, N.J., 1965) which is an excellent compendium on special functions employed in 

physics. 
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For w(x, t) holds 

dlnw(x,t) _ x - t 

Qt " i _ 2xt + t 2 • (5 59j 

Using dlnw/dt = (l/w) dw/dt and multiplying (5.159) by w(l — 2xt + t 2 ) leads to the differential 
equation obeyed by w(x, t) 

(1 - 2xt + t 2 ) ^ + (t - x)w = . (5.160) 

Employing (5.156) and (d/dt) ^ Pi{x)t l = ^ £ £P£(x)^ _1 this differential equation is equivalent 
to 

oo oo 

(1 - 2xt + t 2 ) ^IP^x)^ 1 + (t - x) ^2P e (x)t e = (5.161) 

Collecting coefficients with equal powers t e yields 

= P^x) - xP (x) (5.162) 

oo 

+ ^[(^+1)%W - (2£+l)xP e (x) + eP e -i(x)]1? 
e=i 

The coefficients for all powers t e must vanish individually. Accordingly, holds 

Pi(x) = xP {x) (5.163) 
+ l)P t +i(x) = {21 + 1)1^(1) - iPt- X (x) (5.164) 

which, using Pq(x) = 1 [c.f. (5.151)] allows one to determine Pe(x) for i = 1,2,.... 

Inversion Symmetry of Yf m (9,4>) Under inversion at the origin vectors r are replaced by —r. 
If the spherical coordinates of r are (r,9,<f>), then the coordinates of — f are (r,ir — 0,ir + 4>). 
Accordingly, under inversion Yi m (9, (ft) goes over to Y^ m {TT — 9, it + 4>). Due to cos(7r — 9) = — cos 9 
the replacement r — > —r alters P^ m (x) into Pe m (— x). Inspection of (5.142) allows one to conclude 

*W-z) = (-l)' +m ^ m (x) (5.165) 

since d n /d(—x) n = (—l) n d n /dx n . Noting exp[im(7r + 4>)} = (—l) m exp(imcf)) we determine 

Y em (7T - 8, 7T + 0) = (-1) £ 5^(0, 0) • (5-166) 

Properties of Y^ m {9, (f>) 

We want to summarize the properties of the spherical harmonics Y£ m (9, <f>) derived above. 
1. The spherical harmonics are eigenfunctions of the angular momentum operators 

J 2 Y lm {9^) = h 2 £(£+l)Y em (9,cl>) (5.167) 
JzY lrn (9,<j>) = hmY em (6,<j>) . (5.168) 
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2. The spherical harmonics form an orthonormal basis of the space Cqo^) of normalizable, 
uniquely defined functions over the unit sphere S2 which are infinitely often differentiable 

f\m9d9 r ' d4Yt lm ,{p,<t>)Y tm {0,<t>) = 8 e , e 5 m , m . (5.169) 
Jo Jo 

3. The spherical harmonics form, in fact, a complete basis of C 00 (S , 2), i.e., for any f(9,4>) € 
Coo(S 2 ) holds 

00 e 

f( 9 ^) = EE C em Y em (9,4>) (5.170) 

£=0 m=-i 
rn /•2-7T 

c tm = / smOde d<f>Y; m (o,<i>)f(e,<t>) . (s.m) 

Jo Jo 



4. The spherical harmonics obey the recursion relationships 



J + Y tm (9,<l>) = hy/(e + m + l)(e- m) Y £m+1 (9, <j>) (5.172) 
J-Y em+1 (9,<f>) = hy/(£ + m + l)(e-m)Y em (9,4>) (5.173) 



where J± = J\ ± 1J2. 
5. The spherical harmonics are given by the formula 



™ - J^ti^tf si »™0x (5.74) 



9 



P^cos 9) e im * , m > 



. d cos , 
where (£ = 1,2,...) 

P (x) = 1 , Pi(s) = x , (5.175) 

P m (x) = ^i[(2£ + 1)iP<(i) " tPt-i(x)] (5-176) 

are the Legendre polynomials. The spherical harmonics for m < are given by 

Y,- m {9A) = (-l) m Y e * m (9,cP) . (5.177) 

Note that the spherical harmonics are real, except for the factor exp(im0). 

6. The spherical harmonics Y^o are 



21 + 1 

^o(M) = \l-^-P e (cos9) . (5.178) 



7. For the Legendre polynomials holds the orthogonality property 

r+1 



J dxP e (x)P(£'(x) = ^i 6 *- ( 5 - 179 ) 
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8. The spherical harmonics for 6 = are 



Y em (e = o,<p) = 5, 



mO 



2^ + 1 

4-7T 



9. The spherical harmonics obey the inversion symmetry 

Y tm (6,<f>) = (-l)'Y« m (7r-0,7r + $ 
10. The spherical harmonics for £ = 0, 1, 2 are given by 

^oo = 



47T 



(5.180) 



(5.181) 



(5.182) 



Y 



10 




sin v e 



i<fi 




COS 



(5.183) 
(5.184) 



together with (5.177). 
11. For the Laplacian holds 



Y 



22 



Y 



21 



Y20 



1 /15 
4V 2^ 

T5 




sin 2 0e 2 ^ 



sin 8 cos e*' 



8vr 

— - cos 8 

4vr 12 2 



V 2 h(r)Y tm (e,<f>) 



1 d 2 _ 1(1 + 1) 



/i(r)y €m (M) 



(5.185) 
(5.186) 
(5.187) 



(5.188) 



Exercise 5.4.1: Derive expressions (5.86), (5.87). 

Exercise 5.4.2: Derive the orthogonality property for the Legendre polynomials (5.179) using the 
generating function w(x,t) stated in (5.156). For this purpose start from the identity 



r+l 00 r+1 

/ dxw(x,t) 2 = tW ' dxP e (x)P e ,(x) , 
e,e'=o " " 1 



(5.189) 



evaluate the integral on the l.h.s., expand the result in powers of t and equate the resulting powers 
to those arising on the r.h.s. of (5.189). 

Exercise 5.4.3: Construct all spherical harmonics Ys m (8, (f>). 
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5.5 Irreducible Representations 

We will consider now the effect of the rotational transformations introduced in (5.42), 

exp ^ — J^j, on functions /(#,</>) G Coo(S^). We denote the image of a function f(9,(f>) 
under such rotational transformation by f(6,(/)), i.e., 

/(M) = expf-^./)/(0,<£). (5.190) 

Since the spherical harmonics Ye m (9,<fi) provide a complete, orthonormal basis for the function 
space Coo(S^) one can expand 

f(9,<j>) = J2 c ^Yem(e,4>) (5.191) 

£,m 

ce m = J s dQ' Y e * m (9', 0') exp ( J^j f(9', cj>') (5.192) 

One can also represent Coo(§>£) 

Coo(S>£) = [{Y e> , £ = oo, > = -£,-£+¥, ...,£}} (5.193) 

where [ { } ] denotes closure of a set by taking all possible linear combinations of the elements 
of the set. The transformations exp ^ — | $ ■ J ^ , therefore, are characterized completely if one 
specifies the transformation of any Y£ m (9, </>) 

Y tm {B,<t>) = V , ; *WM) (5.194) 

* — * L J £'m';£m 

I' ,m' 

= [ ( mY e 1 m/ (9,4>)exJ- l -S-j)Y em (9, ( />), (5.195) 

H'm';£m J $ 2 \ tl J 

i.e., if one specifies the functional form of the coefficients [^(i?)]^ m /.^ m - These coefficients can 
be considered the elements of an infinite-dimensional matrix which provides the representation of 
exp ( — 1# ■ J^j in the basis {Y £m , £ = 0, 1, . . . oo, m = -£,-£+ 1, £}. 

We like to argue that the matrix representing the rotational transformations of the function space 
Coo(Sjzf), with elements given by (5.195), assumes a particularly simple form, called the irreducible 
representation. For this purpose we consider the subspaces of Coo(S^) 

X e = [{Y e> , > = -£,-£+¥, ...,£}} ,£=r,V,*,... (5.196) 

Comparision with (5.193) shows 

oo 

Coc(S>0 = (J X t . (5.197) 

The subspaces X^ have the important property that (i) they are invariant under rotations 
exp ^ — j= •& ■ <J^j, i- e -> ex P ( — ji $ " <J^j (^Q) = X^, an d (h) they form the lowest dimensional sets X^ 
obeying Coo(S>i) = \J e X^ which have this invariance property. 
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The invariance of the subspaces X^ under rotations follows from a Taylor expansion of the expo- 
nential operator exp ^ — | ■& ■ J^j . One proceeds by expanding first ■& ■ J in terms of J + , and 

1 „ _ 1 



(5.198) 

where <&± = i?i =F #2 ■ The exponential operator exp ^ — 1 1? ■ J J can be expanded in powers 
(^#-,7+ + ^$+^7- + $3,73)"'. One employs then the commutation properties 

[J+, J-] = 2HJ 3 , [J 3 , J±] = ±hj ± (5.199) 

which follow readily from (5.56, 5.65). Application of (5.199) allows one to express 



exp 



-U-j 

n 



E, T n l T n 2 T n 3 

t -ni,n 2 ,n 3 U+ J- U3 



(5.200) 



Accordingly, one needs to consider the action of J+ 1 J™ 2 \J^ 3 on the subspaces X^. Since the Yi m are 
eigenfunctions of Jz the action of reproduces the basis functions of X^, i.e., leaves X^ invariant. 
The action of J_ on Y^ m produces Yi m -\ or ) m cas e m = —£, produces 0, i.e., J™ 2 also leaves 
X^ invariant. Similarly, one can conclude that J™ 1 leaves X^ invariant. Since any additive term 

in (5.200) leaves X^ invariant, the transformation exp ^ — | ■d ■ J^j leaves X^ invariant as well. One 

can conclude then that in the expansion (5.194) only spherical harmonics with £' = £ contribute 
or, equivalently, one concludes for the matrix elements (5.195) 



W)\v m --,t m = hi j s dnY; m ,(e,<f>)e W (-^-j )Y em (9,cf>) . 



(5.201) 



From expression (5.200) of the rotational transformations one can also conclude that the X^ are 
the smallest invariant subspaces of Coo (§><). 

If one orders the basis according to the partitoning (5.197) of Coo(S>i) the matrix representation of 
T>($) assumes a block-diagonal form, with blocks of dimension 1, 3, 5, . . ., i.e., 



1 x 1 



\ 



3x3 



2>(<?) 



(5.202) 



(2£+ 1) x (2£+ 1) 



V 



/ 
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This representation of T>{d) is the one wich has blocks of the lowest dimensions possible. The 
representation is referred to as the irreducible representation, other representations are termed 
reducible representations. 



Exercise 5.5.1: Representations of the group SO(2) 

SO(2) is the set of all 2 x 2 matrices R which are orthogonal, i.e. for which holds R T R = R R T = E, 
and for which deti? = 1. 

(a) Show that SO (2) with the group operation o defined as matrix multiplication, is a group. 

(b) Prove that the elements of SO(2) can be completely characterized through a single parameter. 

(c) Show that the map R(f) 

x' \ / cosy? — s\wp \ / x \ . . / x 

y' ) ~ \ sin^ cosy? ) \ y ) ' \y 

defines a representation of SO(2). 

(d) Show that the similarity transformation T(ip) defined in the space of non-singular, real 2x2- 
matrices O through 

O 1 = T(ip)0 = R^OR' 1 ^) 
with R(<p) as in (c) is also a representation of SO(2). 

(e) In the space C°°(F) of infinitely often differentiable, and periodic functions /(a), i.e. f(a + 
2tt) = /(a), the map p{ip) defined through 

/(a) 4 g {tp) = /(a - tp) = p(<p)f(a) 

also defines a representation of SO(2). Determine the generator of p(ip) in analogy to (5.22, 5.49). 

(f) The transformation p(ip) as defined in (e) leaves the following subspace of functions considered 
in (e) 

X rn = {/(«) = A e~ ima , i G C, > G N} 
invariant. Give the corresponding expression of p(tp). 



5.6 Wigner Rotation Matrices 

We will now take an important first step to determine the functional form of the matrix elements 
of T>{$). This step reconsiders the parametrization of rotations by the vector 1? assumed sofar. The 
three components k = 1,2,3 certainly allow one to describe any rotation around the origin. 
However, this parametrization, though seemingly natural, does not provide the simplest mathe- 
matical description of rotations. A more suitable parametrization had been suggested by Euler: 

every rotation exp ^ — J^j can be represented also uniquely by three consecutive rotations: 

(i) a first rotation around the original X3~axis by an angle a, 

(ii) a second rotation around the new a^-axis by an angle (3, 

(iii) a third rotation around the new x^'-axis by an angle 7. 
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The angles a, /?, 7 will be referred to as Euler angles. The axis x' 2 is defined in the coordinate frame 
which is related to the original frame by rotation (i), the axis X3 is defined in the coordinate frame 
which is related to the original frame by the consecutive rotations (i) and (ii). 
The Euler rotation replaces exp ^ — | -d ■ J^j by 

e -h J 3e-R0 J ae-K aj3 . (5.203) 
For any # G R F one can find Euler angles a, (3, 7 G M such that 

expf-ltf-J^ = e-Me-R^e-R " 73 (5.204) 

is satisfied. Accordingly, one can replace (5.194, 5.195) by 

yuo,4>) = Y.^ a ^^' m '-j m yue^) (5.205) 



I'm' 



[V(a,P, 7 )] em , em = I dnY e 1 ml (9,<P)e-^ J 'U-^e-^ J WU0,<P) 

Js 2 



(5.206) 



The expression (5.203) has the disadvantage that it employs rotations defined with respect to three 
different frames of reference. We will demonstrate that (5.203) can be expressed, however, in terms 
of rotations defined with respect to the original frame. For this purpose we notice that J' 2 can be 
expressed through the similarity transformation 

J' 2 = e -i aj3 J 2 es aj3 (5.207) 

which replaces J' 2 by the inverse transformation (i), i.e., the transformation from the rotated frame 
to the original frame, followed by J2 in the original frame, followed by transformation (i), i.e., the 
transformation from the original frame to the rotated frame. Obviously, the l.h.s. and the r.h.s. of 
(5.207) are equivalent. For any similarity transformation involving operators A and S holds 

e^s- 1 = SeAS- 1 . (5.208) 

Accordingly, we can write 

e -iP J 2 = e~i aj3 e-R^ 2 es aJs (5.209) 
The first two rotations in (5.203), i.e., (i) and (ii), can then be written 

e -i/^2 e -W3 = e ->j 3 e -f/3J 2 (5.210) 

The third rotation in (5.203), in analogy to (5.209), is 

e-i^ = e -s/34 e -i°Js e -i7^ JaJ 3 e iM ( 5 . 2 ll) 

Using (5.209) in this expression one obtains 

= e -R 7j 3 = e~i aj3 e~^ j2 e -B^ J 3 e kM e i aj3 (5.212) 
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Multiplication from the left with (5.210) yields the simple result 

e-i^e-iW = e~* aJ * , (5.213) 

i.e., to redefine the three rotations in (5.203) with respect to the original (unprimed) frame one 
simply needs to reverse the order of the rotations. This allows one to express the rotational 
trasnformation (5.205, 5.206) by 

Y tm (0,(f>) = J2i V ( a ^^Wm Y im(0,4>) (5-214) 

t',m' 

P(«,A7)] Wm = / dQYl lml {6A)e-^e-^e-i^Y im {0A)- 

Js 2 

(5.215) 

The evaluation of the matrix elements (5.215) benefits from the choice of Euler angles for the 
parametrization of rotational transformations. The eigenvalue property (5.64) yields 

/ dnf(e,</>)e-^s Yem (e,<f>) = [ dnf(e l( p)Y em (o,cp)e- im ^. (5.216) 

Js 2 Js 2 
Using, in addition, the self-adjointness of the operator J% one can state 

I ffiy/ m (M)e-H/(M) = e- im ' a f dQYl ml {6A) f{0A) ■ (5.217) 

Accordingly, one can write 

[V{a,f},i)\ tm ;e m > = e" W (jf dSl Y? m ,(9, cp) e"*^ *WM)) See (5.218) 
Defining the so-called Wigner rotation matrix 

€L(P) = I dnY; m ,(0,<t>)e-fr J »Y em (0,<t>) (5.219) 
one can express the rotation matrices 

PKMWw = e" W <f%JP) e~^ m S ee ■ (5.220) 

We will derive below [see Eqs. (5.309, 5.310)] an explicit expression for the Wigner rotation matrix 
(5.219). 

5.7 Spin \ and the group SU(2) 

The spin describes a basic and fascinating property of matter. Best known is the spin of the 
electron, but many other elementary components of matter are endowed with spin-like properties. 
Examples are the other five members of the lepton family to which the electron belongs, the electron 
e and the electron neutrino v e of the first generation, the muon \x and its neutrino v„ of the second 
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generation, the tau r and its neutrino v T of the third generation and their antiparticles carry a 
spin-^. So do the three generations of six quarks, two of which (in certain linear combinations) 
make up the vector mesons which carry spin 1, and three of which make up the baryons which 
carry spin-| and spin-|. There are also the mediators, the gluon, the photon, the two W^ 1 and the 
Z°, particles which mediate the strong and the electro- weak interactions, and which carry spin 1. 
The particles mentioned, e.g. the quarks, carry other spin-like properties which together, however, 
have properties beyond those of single spins. In any case, there is nothing more elementary to 
matter than the spin property. The presence of this property permeates matter also at larger scales 
than those of the elementary particles mentioned, leaving its imprint on the properties of nuclei, 
atoms and molecules; in fact, the spin of the electron is likely the most important property in 
Chemistry. We may finally mention that the spin is at the heart of many properties of condensed 
matter systems, like superconductivity and magnetism. It appears to be rather impossible for a 
Physicist not to be enamored with the spin property. We will find that the spin in its transformation 
behaviour is closely related to angular momentum states, a relationship, which might be the reason 
why consideration of rotational symmetry is so often fruitful in the study of matter. 
We will consider first only the so-called spin-^, generalizing then further below. Spin-^ systems 
can be related to two states which we denote by x+ and X-- Such systems can also assume any 
linear combination c+x+ + c -X-> c ± £ C, as long as |c+| 2 + |c_| 2 = 1. If we identify the states 
X+ and X- with the basis of a Hilbert space, in which we define the scalar product between any 
state |1) = a+x+ + a -X- and 1 2) = b + x+ + &-X- as (1|2) = a+6+ + a*_b-, then allowed 
symmetry transformations of spin states are described by 2 x 2-matrices U with complex-valued 
matrix elements Ujk- Conservation of the scalar product under symmetry transformations requires 
the property U U' = U'U = IL where U' denotes the adjoint matrix with elements \U '] . fc = U^-. 
We will specify for the transformations considered detC/ = 1. This specification implies that we 
consider transformations save for overall factors e 1 ^ since such factors are known not to affect any 
observable properties. 

The transformation matrices are then elements of the set 

SU(2) = {complex 2 x 2 matrices U; U = U ] U = 11, detf7 = 1} (5.221) 

One can show readily that this set forms a group with the groups binary operation being matrix 
multiplication. 

How can the elements of SU(2) be parametrized. As complex 2x2 matrices one needs, in principle, 
eight real numbers to specify the matrix elements, four real and four imaginary parts of Ujk,j, k = 
1,2. Because of the unitarity condition WU = IL which are really four equations in terms of real 
quantities, one for each matrix element of U^U, and because of detU = 1, there are together five 
conditions in terms of real quantities to be met by the matrix elements and, hence, the degrees of 
freedom of the matrices U are three real quantities. The important feature is that all U £ SU(2) 
can be parametrized by an exponential operator 

U = exp^-itf-S^J (5.222) 

where the vector S has three components, each component Sk,k = 1,2,3 representing a 2 x 2 
matrix. One can show that the unitarity condition requires these matrices to be hermitian, i.e. 
[Sj] mil = ([Sk] nm )*, and the condition detU = 1 requires the Sk to have vanishing trace. There 
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exist three such linear independent matrices, the simplest choice being 



^o-i, S 2 = -<r 2 , S3 = ^0-3 (5.223) 



where a k , k = 1,2,3 are the Pauli matrices 
Algebraic Properties of the Pauli Matrices 

The Pauli matrices (5.224) provide a basis in terms of which all traceless, hermitian 2 x 2-matrices 
A can be expandend. Any such A can be expressed in terms of three real parameters x, y, z 

A ={ x liy "I*")' «.*«.€«■ (5-225) 

In fact, it holds, 

A = xo\ + yoi + Z1T3 . (5.226) 

The Pauli matrices satisfy very special commutation and anti-commutation relationships. One can 
readily verify the commutation -property 

OjO k - a k (Jj = [o-j, cr fe ]_ = 2ie jk eo-£ (5.227) 

which is essentially identical to the Lie algebra (5.31) of the group S0(3). We will show below that a 
2-1-homomorphic mapping exists between SU(2) and S0(3) which establishes the close relationship 
between the two groups. 

The Pauli matrices obey the following anti- commutation properties 

<jjO k + a k o-j = [o-j, a k ] + = 25 jk lL , (5.228) 



i.e. 



a 



^ 2 = 11 ; a j a k = -a k a j for j ^ k (5.229) 



which can also be readily verified. According to this property the Pauli matrices generate a 3- 
dimensional Clifford algebra C3. Clifford algebras play an important role in the mathematical 
structure of physics, e.g. they are associated with the important fermion property of matter. At 
this point we will state a useful property, namely, 

(& ■ a) (a ■ b) = a ■ b 1 + ia ■ (o x b) (5.230) 

— * 

where a, b are vectors commuting with <r, but their comonents must not necessarily commute with 
each other, i.e., it might hold ajb k — b k aj / 0. The proof of this relation rests on the commuta- 
tion relationship and the anti-commutation relationship (5.227, 5.229) and avoids commuting the 
components aj and b k . In fact, one obtains 



(a-a)(a-b) = £j,fc=i (Jo* a.jb k 



E j= i(° j ) 2 ajbj + Z%k=i ^a k a 3 b k + Z%k=i ^a k a 3 b k . (5.231) 

j>k j<k 
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Using ((ji) 2 = 1 the first term on the r.h.s. yields a • b. The two remaining terms yield using 
oio k = —a k (ji for j / k and altering 'dummy' summation indices 

Yl%k=i ( a jh ~ a k bj) = 
j>k 

\ Ej,fc=i o 3 o k (ajb k - a k bj) - \ J2%k=i (a>jh ~ akbj) = 

j>k j>k 

\ E?,fc=i (° j ° k ~ ° k ° J ) (aj6fc " a k bj) . (5.232) 
j>k 

The commutation property (5.227) leads to 

3 3 

i ^2 e jk eai(ajb k - a k bj) = i ^ a e ( a x b) . (5.233) 

j,k=l 1=1 
j>k 

This result together with (5.231) proves (5.230). 
In the special case a = b G holds 



(a -a) 2 = a l \ . (5.234) 



5.8 Generators and Rotation Matrices of SU(2) 

Sofar there is nothing which relates the transformations U, i.e. (5.222), to rotations in space. This 
relationship emerges, however, through the algebra obeyed by the operators S k 

[S k , Se] = ie k e m S m (5.235) 

which is identical to that of the generators of SO (3). The algebra of the generators of a transforma- 
tion is such a basic property that the 'accident' that the generators of spin transformations behave 
in this respect like the generators of 3-dimensional rotations makes spins appear in their physical 
behaviour like rotations. Well almost like it, since there is a slight difference: when you interprete 
the 3-component i? in (5.222) as a rotation vector and you rotate once, let say around the X2~axis 
by 360°, spin changes sign; only a 720° rotation leaves the spin invariant. We like to derive this 
result now by evaluating the transformations of SU(2) explicitly. For this purpose we choose to 
replace (5.222) by the Euler form 

exp(— i'ySs) exp(— z/JS^) exp(— iaSs) . (5.236) 

The matrix elements of this 2x2 operator can be labelled by the basis states x+ an d X-, however, 
we like to draw in this respect also on a close analogy to angular momentum states which developes 
if one considers the operators S 2 = S 2 + Sf + £3 an d 5*3. Noticing the idempotence of the S k s 

SI - \ ( I J ) (5.237) 

one obtains S ,2 = |lL = \{ \ + 1)11 and, hence, 

S 2 X± = \{\ + l)x± ; S 3X ± = ±\x± ■ (5.238) 
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This result implies that the states x± behave in relation to the generators of SU(2) like an angular 

in relation to the generators of SO (3). We may, therefore, use the label 



momentum state 



1 2 ± \) for the states x±- 
We obtain with this notation for the transformations (5.236) 

[\m | exp(— ijSs) exp(— i(3S2) exp(— ictSs) \ \m') 



_ „-ijm 1 1 



(jffi | exp(— i/3S2) | 



\m!) e~ iam ' 



(5.239) 



where we made use of the notation (5.219). 

In order to determine the Wigner matrix in (5.219,5.239 ) one expands the exponential operator 
exp(— i^S-i). For this purpose one needs to determine the powers of —i(3S2- The idempotence 
property of Sk yields particularly simple expressions for these powers, namely, 

' §\ 2n 



x 2n 



2n+l 



(-ps 2 ) 

(-(3S 2 ) 2n+1 = (-1)" (f) 
Taylor expansion of the exponential operator yields then 



V 



exp(-i/?S 2 ) = 



OO r 

E(-!) n 

n=0 ^ 



2n 



1 + 



oo 

E(- X ) T 

n=0 



1 



-1 



2n+l 



1 



(5.240) 
(5.241) 

(5.242) 



The expressions in brackets [. . .] can be identified with the Taylor expansion of the cos- and sin- 
functions and one obtains for the rotation matrices 



cos 
s'mi 



— sin 

COS t 



= -i ; 

i.e. rotation by 360° changes the sign of the spin state. 

The complete matrix elements (5.239) in the notation (5.220) are 



We note the property of this rotation matrix 



(5.243) 



(5.244) 



_jf« + : 

! 2 P ' 



sin| e 



-singe 1 2 2 
costt e ( 2 ^2 



(5.245) 



5.9 Constructing Spin States with Larger Quantum Numbers 
Through Spinor Operators 

In this section we like to demonstrate, following Jourdan and Schwinger, that states \£m) for higher 
quantum numbers £ = 0, ^, 1, |, 2, . . ., m = —£, —£+l,...£ can be constructed formally from spin 
states x± if one considers the two properties x± to be carried by two kinds of bosons, i.e. identical 
particles any number of which can exist in the same state x+ an d X- ■ One cannot consider the 
entities carrying the spin-^ to be particles in the ordinary sense for it can be shown that spin-Tj 
particles have fermion character, i.e. no two such particles can exist in the same state. 
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Definition of Spinor Creation and Annihilation Operators 



We present the states x± through creation operators b± which when applied to a formal vacuum 
state |^o) generate %±, i.e. 



6^|*o) = x+ ; fell^o) 



x- 



(5.246) 



The corresponding adjoint operators are denoted by b + and b-. The boson character of the op- 
erators &t = (b Jf + ,b f _) T and b = (6 + ,6_) T is expressed through the commutation relationships 

(C,C = +,-) 

1 = (5.247) 



~b c ,b\, 



(5.248) 



For the vacuum state |^o) holds 

6±|*o) = (5.249) 

In the following will refer to &t = (b^ + ,b^) T and b = (b + ,b-) T as spinor creation and annihilation 
operators. These operators are associated with a given spatial reference system. We consider, 
therefore, also operators of the type 



= x + b\_ + x-b[_ ; x*b = x* + b + + x*_b- 



(5.250) 



which for x*x = x* + x + + x*_x^ = 1 represent spinor operators in an arbitrary reference system. 
For example, using (5.243) the creation operators in a coordinate system rotated by an angle (3 
around the y-axis are 



( 6 ' + )t = cos ^ 6 t + + sin ^ 6 t_ . (j/_)t = _ sin ^6t + + cos ^ 6 t_ 







P 



(5.251) 



One can show using this property and (5.247,5.248) that (b'^ and b'^ obey the commutation 
relationships 



[b> c ,b> 



C'J 



b' c ,b'l, 



b%b% 

= (5 CC / . 



(5.252) 
(5.253) 



The States |*(j,m)) 



The operators allow one to construct a set of states which represent j + m-fold and j — m-fold 
X+ and X- states as follows 



1*0',™)} = 



j+m 



Vti + m ) ! Vti ~ m )- 



1*0) . 



(5.254) 



We will show below that these states are orthonormal and form spin states with higher quantum 
numbers, i.e. j = 0, \, 1, §, . . . ; m = -j, -j + 1, . . . j. 
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5.10 Algebraic Properties of Spinor Operators 

We want to establish first a few important and useful algebraic properties which result from the 
commutation relationships (5.247, 5.248) and from (5.249). 

The Spinor Derivative Operator 

A most important operation is the action of the annihilation operators b + and 6_ on operators 
which can be expressed as polynomials, possibly infinite power series, in b + and bL. An example 

/ 4. \j+m /_ + \j-m 

is the monomial ( 6^ ' 



6* 



+ j \"-J which generates the states (5.254). We will derive the result, 
well-known for the quantum mechanical harmonic oscillator, that the annihilation operators play 
a role similar to the differential operator in calculus. 

For this purpose we consider the following polynomial of creation operators 



N 



(5.255) 



n=0 



In order to determine how this operator is modified by multiplication from the left by b+ we note 



b + f(b + ) |state) = b+, f(b + ) (state) + f(b\) b+ |state) 
In the special case | state) = |^o) this reads using (5.249) 

6+/(6|,)|*o) = \b+,f(bU] |* ). 



(5.256) 



(5.257) 



Equations (5.256) and (5.257) motivate us to determine the commutator b+, f(b\_) 
(5.255) is 



which for 



(5.258) 



Obviously, we need to evaluate b+, (j^+J ] • We will show that for this commutator holds 



b + ,(b{) n ] = n(b{) 



n-l 



(5.259) 



The property is obviously true for n = 0. If it is true for n then using (5.248) we obtain 



b+Ab 



n+l 



b+,(b 



bl+(b\ 



= n(biy-\u(biy 

= (n+l)(b\ 



+ ' 



(5.260) 
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i.e. the property holds also for n+ 1. By induction we can conclude that (5.259) holds for all n G N. 
One can finally conclude for polynomials (5.255) 



where f'{x) = Similarly, one can prove 



b-,f(bl 



f'(b 



f'(bl 



(5.261) 



(5.262) 



Because of the commutation relationships (5.247) the more general property for polynomials 
/(&LbL) in b{ and b ] _ holds (C = +,-) 



d_ 
dbt 



f(bl,bl 



(5.263) 



According to (5.257) we can conclude in particular 



b c f(b\,bl)\V ) = -^f(bl,bl)\V ). 



(5.264) 



This demonstrates the equivalence of the spinor operators b^ and the derivative operation. 
Equation (5.263) corresponds to the product rule of calculus d k f(x)g(x) = (d k f(x))g(x) + 
f(x)(d k g(x)) which can be written (d k f(x) - f{x)d k )g{x) = [d k ,f(x)\ g{x) = (d k f{x))g(x) 
or 

[d k J(x)} = O k f(x) . (5.265) 



Generating Function of the States \^f(j,m)) 
We want to prove now the property 

oo j 

exp (xtf) |vP ) = Yl <f>jm(x)V(j,m) (5.266) 

j=Q,\,l,...m=-j 

where xtf has been defined in (5.250) and where <pj m (x) represents the function of the two variables 
x + and X- closely related to \^(j,m)) 

j+m j—m 

(X) = Ift \u- TT • (5 - 267) 

exp (cc&t) l^o) is called a generating junction of |*(i, m)). 
In order to derive (5.266) we compare the terms 

j+m / , \ j— m 



ef> jm (x)y(j,m) = V 7 V. 7 |*o). (5.268) 
(j + mj! (j — my. 
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with the s-th term in the binomial expansion of (a + b) n 

Ti I 

a n - s b s . 



s\ (n — s)l 



Defining 



one obtains 



j + m = N — s 
j — m = s . 



cf) jm (x) ^ (j, m) 



x+b 



N-s 



X-b 



{N-s)\ s\ 



l*o> 



1 

m 



Nl 



_(N-s)\s\ 

Summation over s from s = to s = N yields 

N 



-y 



s=0 



Nl 



x+b\ 



N-s 



(x_&L)Vo> 



(N-s)\s\_ 
= ^(x + bl + x-bl) N \*o) 

= j^X x+b + +x - b ~T l * o) 

The summation over s can be written in terms of j and m using (5.270) 

N 2j j 

E- E - E ■ 

s=0 j—m=0 m=—j 

Change of the summation indices allows one to conclude 

j 

E 4>jm(x)y (j,m) 
m=-j 



Summing this expression over 2j = 0,1,2,..., i.e. chosing the summation index j 
leads to 



j=0 ± l,...m=-j 



E E (*) I* 0'. ^)) = E 27! (* 6t ) ' l*0> 

j=0,i,l,... 
oo 

- E ^K>»> 



it=0,l, 
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which concludes our derivation. The generating function allows one to derive various properties of 
the states |^ (j,m)) and will be used for this purpose below. 

Orthonormality of the States (j, m)) 

We want to show now that the states (5.254) are orthonormal, i.e. that 

(* (j, m) |* (/, m')> = 5 jr 5 mm , (5.276) 
holds. For this purpose we consider the inner product 

= £ <t>im(x*)<l>? m >(v){*ti,™)\*{j',™'))- (5-277) 

j,m,j',m' 

To evaluate this expression we first notice 

e( xftt )l ^ = e {x * b) . (5.278) 

which allows us to replace the l.h.s. of (5.277) by (^o |e^ x b ^e^ ybt ^o). In order to evaluate the 
operator e^ x * b ^e^ yb ^ we notice that the derivative property (5.264) implies 

6 c e( yfet )|* > = y^) |tf > • (5.279) 

One can generalize this to 

(6 c ) s e(^ t )|* ) = y^ !/6+ )|*o) • (5.280) 

The commutation properties [&+,&_] = 0, [£>+,bl] = and = allow one to state that for 

any polynomial f(b + ,b ) holds 

/(6+,6_)e(« 6t )|* ) = f(y+,y-)e(y b ')\V () ) (5.281) 

and, hence, one can write 

<e( xbt )* |e(w 6t )*o> 
= (^ \e {x * y) e( ybt )^ ) 

= (* |e( ybt )*o) (5.282) 

where we defined x*y = x* + y + + x*_y-. According to (5.249) for any non-vanishing integer s holds 

(^o\b s ^o) = (5.283) 

and, therefore, we can conclude 

(e( xb+ )*o|e( ybt )*o) = e {x ' y) (5.284) 
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and comparing (5.282) and (5.277) 

J2 hmixn^fm' (y) (V (j,m)\V {j',m')) = e (*'w> 



(5.285) 



Following steps similar to those in Eqs. (5.268-5.275) one can show 



(5.286) 



J,m 



from which follows immediately the orthonormality property (5.276). 
New Representation of Spin 0, \ , 1, |, . . . States 

We want to demonstrate now that the states |\& (j,m)) as given in (5.254) are eigenstates of oper- 
ators J 2 and J3 with eigenvalues j(j + 1) and m where J 2 and J3 are representations of the spin 
operators S 2 = S 2 + Sf + S 2 and S3 defined in 5.223, 5.224). One defines corresponding operators 
Jfc through 

' " (5.287) 



Jk = \Y. b l < ChfclC > k' > 



where a^, k = 1,2,3 denote the Pauli spin matrices (5.224). In our present notation the matrix 
elements < C l°"fc| C > f° r C = =t correspond to the matrix elements < C l^fcl C' > f° r CT = ^j- The 
operators are explicitly, using < + |o"i| + > = <— |o"i|— > = 0, < + |<7i| — > = < — |<ti| + > = 1, 
etc., 



Ji = \(b\.b- + blb+) 
h = ± fib- bib. 
J 3 = ±(blb+-blbJ) 
The three operators obey the Lie algebra of SU(2) 

[Jj, Jj] = i^ijkJk • 

This property is derived as follows 

1 



(5.288) 



(5.289) 



[Jj, Jj 



^ < n \ui\ m X n' \<Tj\ m' > &t& m , b\,b r , 



n,n' ,711,111/ 



1 



< n \u{\ m >< n \<Tj\ m! > {6^ 



b m , b\, 



n,n' ,rn,rn; 



+ b i> \bl,b m > b m } 



n,n' -nun: 
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= - ^2 < n\<Ji\m >< m\aj\m > b^bm' 

n,m,m' 

— - <n' \uj\m' ><m' \ai\m>b\ l ,b m 

n',m,m' 

= \j2< n \[ (J i^j}\m>bl l b m 

n,m 

= \ < n \ 2ie ijk<7k\ m > b^bm 

n,m,k 

= ie ijk^J2 < n \0k\m > blfim 

n,m 

= ie ijk J k . (5.290) 
|* (j, m)) as Eigenstates of J 2 and J3 

We wish to show now that the states |\I> (j,m)} are eigenstates of J 2 and J3. To this end we note 
J 2 = J 2 + Jf + J 2 = Ij + J_ + ^J-J+ + Jf . (5.291) 

Here we have defined 

J+ = Ji + iJ 2 = b ] + b_ 

J_ = J x - i.J 2 = b ] _b + . (5.292) 
The commutation relationships (5.289) yield 

[J_, J+] = [Ji - iJ 2 , Ji + iJ 2 ] = -i[J 2 , Ji] + i[Ji, H = -2 J 3 (5.293) 
from which we can conclude the property 

\j-J+ = \j+J- ~ h ■ (5.294) 

Hence, 

J 2 = J+ J_ + J 3 2 - J 3 = J 3 (J 3 - 1) + J+J- • (5.295) 
The last result together with (5.292), (5.288) yields first the expression 

J 2 = |(6 t +&+ - & t _6_)(6 t +&+ - 6 t _6_ - 2) + b ] +b-b ] -b + . (5.296) 
which results in 

J2 = |(6t + fe + 6t + 6 + + 6t +6+ 5t_6_ - 2 6t + 6 + - tf_b_tf + b + 

+ 6t_6_fet_ & _ + 2 6t_fo_ + 4frt + 6_6t_ 6+ ) (5.297) 
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The last term on the r.h.s. can be written 

&t + 6_&t_6+ = b\b + + &t+6 + &t_6_ 

= tf + b + + i&t +&+6 t_ 6 _ + l6t_ 6 _ 6 t +6 ^ 



(5.298) 



One can then state 



J 



(tf + b + ) 2 + 6 t +M t -&- + 6"L6_6 t +&+ 
+ (6"L&_) 2 + 2&t+6+ + 2&t_&. 



Defining the operator 



one can write finally 



k = \{tf + b + + 6L&_) 



J 2 = k(k + 1 ) . 



(5.299) 



(5.300) 



(5.301) 

Obviously, the states (5.254) are eigenstates of 6^+6+ and 6^_6_ with eigenvalues j + m and j — m, 
respectively, and eigenstates of k with eigenvalues j. One can then conclude 

J 2 \*(j,m)) = j(j + l)|*(j,m)) (5.302) 
J 3 |*(j,m)) = m|#(j,m)). (5.303) 

One can furthermore derive readily 

J+|*(j,m)) = + m + l)(j-m)\V {j, m + 1)) (5.304) 

J_|*(j',m)) = + m)(j-m + 1)|* (j, m - 1)) . (5.305) 



Exercise 5.10.1: The system investigated in this section is formally identical to a 2-dimensional 
isotropic harmonic oscillator governed by the Hamiltonian 

H = huiibfh + b+b 2 + 1) ; [bj,b+] = 5 jk , j = 1,2 . 

(a) Show that the eigenstates are given by 

K,n 2 ) = * (btr |0>™ (^) n2 |0> 2 
\/ni! v n 2! 

where the vacuum states are defined through 6j|0)j = 0. State the corresponding eigenvalues and 
the degree of degeneracy, i.e., the number of states to the possible energy eigenvalues. 

(b) Show that the three operators 

h = \(btb 2 + &+&i) , h = ^(6+6 2 - b+fti) , h = \(bfb 1 + 6+6 2 ) 

satisfy the Lie algebra of SU(2), i.e. [Ij,Ik] = i^jkeh- Construct, using operators I± = I± ± il 2 
and the subspace {\n\, n 2 ), n = n\ + n 2 fixed, n-i = 0, 1, 2, . . . , n} eigenstates of I 2 = I 2 +I 2 +I 2 
and of ^3 

I 2 ||A,m) = A||A,m); i3||A,m) = m\\X,m) 
where A = 0, 1 2, . . . , m = -A, -A + 1, . . . , A. Show A = rii+M(!H+n2 + y and m = i( m _ n2 )_ 
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5.11 Evaluation of the Elements d J mm ,(/3) of the Wigner Rotation 
Matrix 

The spinor algorithm allows one to derive expressions for the Wigner rotation matrix elements 
d J mm ,(P)- For this purpose we note that the states \ ^(jm)} in a rotated coordinate system according 
to (5.254) are 

, \ j+m' / i \ j—m' 



./(j + m')! VU - ™ ) ! 

where and 6 _ are given by (5.251). On the other side the states (j, m')) are related to the 
states |^ (j, to)) in the original coordinate system by 



*'(j,TO')> = E $ m ,W\*U,™))- (5-307) 



Comparision of (5.306) and (5.307) shows that the elements of the rotation matrix can be obtained 
by binomial expansion of b + and b _ in terms of b\_ and b_. For this purpose we expand 

'cosfbj. + sinf6l) J+m (-sinf^ + cosf&t V 



ES'ES'( i+ / )(«.§)*■ Hp'"" 

( ' ;™' ) (- S inf)° H) W - (6l) 23 -°'-° (5.308) 

The latter sum involves terms (b + y +m (b_y~ m for a' + a = j + m and 2j — <r' — a = j — m. 
One may expect that these two conditions restrict both a' and a. However, both conditions are 
satisfied for a' = j + to — a. The combination of a, a' values which yields {b + y +m {b^_y~ m is 
then a' = j + m — a. The prefactor of (b\y +m (tf_y~ m which according to (5.306,5.307) can be 



identified with the elements d^}.(3) of the rotation matrix can then be written 

(it fit ^ ' 



(o\ 2j—m—m' — 2a / n \ m+m'+2a 

sin|j (cob| 

In case j = 1 this expression yields, for example, 

/ |(1 + cos/3) ^sin/3 g^ -008 / 3 ) 

= -A sin/3 cos/3 A sin/3 i ( 5 - 31 °) 

y |(1 -cos/3) -^sin/3 ^(i + cos/?) 
and in case j = \ it reduces to (5.243). 
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5.12 Mapping of SU(2) onto SO(3) 



The representation (5.310) establishes a mapping of SU(2) onto SO(3). This can be shown by 

m' m> 



applying to the matrix A = (cQ. 



J in (5.310) the similarity transformation 
A = rfAU 

where U is the 3x3 unitary matrix which establishes the transformation 

ix 2 ) \ / x, \ ( 



(5.311) 




U 



1 



V 72 



V2 





\ 
1 



VI 



(5.312) 



The choice of this transformation derives from a property shown further below, namely that 
the component of the vector on the l.h.s. of (5.312) transforms like an angular momentum state 
|1 m). Hence, the matrix A should represent a rotation around the X2~axis in the space of vectors 
(xi,X2,X3) € R F . Evaluation of A yields 



A 








1 


'(: 





i 




V2 





-1 -i 










V2 


) 


1 -i 









sin/3 1 — cos/3 
2 cos/3 sin/3 
-v 7 ^ sin/3 1 + cos/3 

-2>/2 sin/3 2 cos/3 

2i 
2\/2 cos/3 2 sin/3 

-sin/3 

1 

cos/3 



(5.313) 



which, in fact, is the expected element of SO(3), i.e., the orthogonal 3x3 matrix which describes 
a rotation around the X2~axis. 

It is of interest to trace the mapping from SU(2) onto SO(3) as represented by (5.310) to the 
SU(2) transformation assumed in deriving the general result (5.309) and the particular matrix 
(5.310). The SU(2) transformation entered in (5.308) and had the form (5.251). Replacing the 
latter transformation by its negative form, i.e., 



(//. ) = -cos- b 



ft 



■ 0*t 
sin- 

2 _ 



(«-) 



= sm— 0, — cos— b 
2 + 2 



(5.314) 



leaves (5.308) unaltered except for a factor (— l) 2 - 7 which multiplies then also the final result (5.309). 
This factor implies, however, that the the representation (5.309) does not distinguish between SU(2) 
transformations (5.251) and (5.314) in case of integer j-values, e.g. in case of j = 1. One can, 
therefore, conclude that the mapping from SU(2) onto SO(3) is a 2-1 mapping. 
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Chapter 6 



Quantum Mechanical Addition of 
Angular Momenta and Spin 

In this section we consider composite systems made up of several particles, each carrying orbital 
angular momentum decribed by spherical harmonics Yi m {6, (ft) as eigenfunctions and/or spin. Often 
the socalled total angular momentum, classically speaking the sum of all angular momenta and spins 
of the composite system, is the quantity of interest, since related operators, sums of orbital angular 
momentum and of spin operators of the particles, commute with the Hamiltonian of the composite 
system and, hence, give rise to good quantum numbers. We like to illustrate this for an example 
involving particle motion. Further below we will consider composite systems involving spin states. 

Example: Three Particle Scattering 

Consider the scattering of three particles A, B, C governed by a Hamiltonian H which depends 
only on the internal coordinates of the system, e.g., on the distances between the three particles, 
but neither on the position of the center of mass of the particles nor on the overall orientation of 
the three particle system with respect to a laboratory-fixed coordinate frame. 
To specify the dependency of the Hamiltonian on the particle coordinates we start from the nine 
numbers which specify the Cartesian components of the three position vectors r^, ?b, r~c of the 
particles. Since the Hamiltonian does not depend on the position of the center of mass R = 
(that a + rnsTB + mcrc)/(mA + mB + mc), six parameters must suffice to describe the interaction 
of the system. The overall orientation of any three particle configuration can be specified by 
three parameters 1 , e.g., by a rotational vector i9. This eleminates three further parameters from 
the dependency of the Hamiltonian on the three particle configuration and one is left with three 
parameters. How should they be chosen? 

Actually there is no unique choice. We like to consider a choice which is physically most reasonable 
in a situation that the scattering proceeds such that particles A and B are bound, and particle C 
impinges on the compound AB coming from a large distance. In this case a proper choice for a 
description of interactions would be to consider the vectors tab = ?a — re and pc = (that a + 
iTiBrs) I '(tha + rns) — re and to express the Hamiltonian in terms of (tabI, \pc\-, and <(vab,Pc)- 
The rotational part of the scattering motion is described then in terms of the unit vectors tab and 

1 We remind the reader that, for example, three Eulerian angles a,/?, 7 are needed to specify a general rotational 
transformation 
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pc, each of which stands for two angles. One may consider then to describe the motion in terms 
of products of spherical harmonics Y£ imi (fA B ) Y^ 2m2 {pc) describing rotation of the compound AB 
and the orbital angular momentum of C around AB. 

One can describe the rotational degrees of freedom of the three-particle scattering process through 
the basis 

B = {Ye imi (rAB)Ye 2m2 (p c ),ti = 0,1, ■ ■ ■ ,h,max,-h <m 1 < ly, 

e 2 = o,i,...,e 2 , max ,-e 2 <mi < e 2 } (6.1) 

where £i, m ax and l2,max denote the largest orbital and rotational angular momentum values, the 
values of which are determined by the size of the interaction domain Ay, by the total energy E, 
by the masses uia, tub, mc, and by the moment of inertia Ia-b of the diatomic molecule A-B 
approximately as follows 

9 - A I 2m A m B m c E ~ 1 r— 

h V mAm B + m B mc + mAmc n> 

The dimension d{B) of B is 

£l,max £2,max 

d(B) = £ (2£i + 1) ^ (2^ 2 + 1) = (e 1>max + l)\i 2 ,max + I) 2 (6.3) 

<i=0 £ 2 =0 

For rather moderate values t\ )ma x = h,max = 10 one obtains d(B) = 14 641, a very large number. 
Such large number of dynamically coupled states would constitute a serious problem in any detailed 
description of the scattering process, in particular, since further important degrees of freedom, i.e., 
vibrations and rearrangement of the particles in reactions like AB + C — > A + BC, have not 
even be considered. The rotational symmetry of the interaction between the particles allows one, 
however, to separate the 14 641 dimensional space of rotational states Y£ imi (fAB)Ye 2m2 (pc) into 
subspaces Bk, B\ © B 2 © . . . = B such, that only states within the subspaces Bk are coupled in the 
scattering process. In fact, as we will demonstrate below, the dimensions d{Bk) of these subspaces 
does not exceed 100. Such extremely useful transformation of the problem can be achieved through 
the choice of a new basis set 

B ' = { E &iA,^^iN^(Pc), n = 1,2,. ..14 641} . (6.4) 

(2, rri2 

The basis set which provides a maximum degree of decoupling between rotational states is of great 
principle interest since the new states behave in many respects like states with the attributes 
of a single angular momentum state: to an observer the three particle system prepared in such 
states my look like a two particle system governed by a single angular momentum state. Obviously, 
composite systems behaving like elementary objects are common, albeit puzzling, and the following 
mathematical description will shed light on their ubiquitous appearence in physics, in fact, will make 
their appearence a natural consequence of the symmetry of the building blocks of matter. 
There is yet another important reason why the following section is of fundamental importance for 
the theory of the microscopic world governed by Quantum Mechanics, rather than by Classical 
Mechanics. The latter often arrives at the physical properties of composite systems by adding the 
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corresponding physical properties of the elementary components; examples are the total momentum 
or the total angular momentum of a composite object which are the sum of the (angular) momenta 
of the elementary components. Describing quantum mechanically a property of a composite object 
as a whole and relating this property to the properties of the elementary building blocks is then the 
quantum mechanical equivalent of the important operation of addittion. In this sense, the reader 
will learn in the following section how to add and subtract in the microscopic world of Quantum 
Physics, presumably a facility the reader would like to acquire with great eagerness. 



Rotational Symmetry of the Hamiltonian 

As pointed out already, the existence of a basis (6.4) which decouples rotational states is connected 
with the rotational symmetry of the Hamiltonian of the three particle system considered, i.e., 
connected with the fact that the Hamiltonian H does not depend on the overall orientation of the 
three interacting particles. Hence, rotations 9^(i9) of the wave functions ipi^AB, Pc) defined through 

9t(0) ^(tag,, fa) = il>{*r\3) 3*8, ^0) Pc) (6.5) 

do not affect the Hamiltonian. To specify this property mathematically let us denote by H' the 
Hamiltonian in the rotated frame, assuming presently that H' might, in fact, be different from H. 
It holds then H'D^i?)^ = 9t($)H^. Since this is true for any ip(rAB, Pc) it follows H'9t($) = 
9t(i?) H, from which follows in turn the well-known result that H' is related to H through the 
similarity transformation H' = *H(i9) H!SH _;L (??). The invariance of the Hamiltonian under overall 
rotations of the three particle system implies then 

H = JK^HJK -1 ^ . (6.6) 

For the following it is essential to note that H is not invariant under rotations of only tab or pc, 
but solely under simultaneous and identical rotations of tab or Pc- 

Following our description of rotations of single particle wave functions we express (6.5) according 
to (5.48) 



9t(0) = expl 1 expl 1 (6.7) 

where the generators are differential operators acting on tab (k = 1) and on pc (k = 2). For 
example, according to (5.53, 5.55) holds 

i Mi) 9 d i (2) d d 

~tJi = Z AB o VABT, 5 J3 = PyT, Px-^~ ■ (6.8) 

h dy A B ozab ft op x dp y 

Obviously, the commutation relationships 

Ji 1] ,ji 2) ] =0 for p,q = 1,2,3 (6.9) 

hold since the components of J^ 1 are differential operators with respect to different variables. One 
can equivalently express therefore (6.7) 

«H(#) = exp (6.10) 



144 Addition of Angular Momenta and Spin 

where 

1 = jW + j( 2 ) . (6.11) 

By means of (6.11) we can write the condition (6.6) for rotational invariance of the Hamiltonian in 
the form 

H = exp^-^-f) Hexp^+^-j) . (6.12) 
We consider this equation for infinitesimal rotations, i.e. for <C 1. To order 0(|i?|) one obtains 

Since this holds for any i? it follows HJ - JH = or, componentwise, 

[H, J fc ] = , k = 1,2,3 . (6.14) 

We will refer in the following to Jh., k = 1,2,3 as the three components of the total angular 
momentum operator. 

The property (6.14) implies that the total angular momentum is conserved during the scattering 
process, i.e., that energy, and the eigenvalues of J 2 and J3 are good quantum numbers. To describe 
the scattering process of AB + C most concisely one seeks eigenstates 2) jm of J 2 and J3 which can 
be expressed in terms of Y^ mi {f AB ) Y e2m2 (p c ). 



Definition of Total Angular Momentum States 

The commutation property (6.14) implies that the components of the total angular momentum 
operator (6.12) each individually can have simultaneous eigenstates with the Hamiltonian. We 
suspect, of course, that the components JT^, k = 1, 2, 3 cannot have simultaneous eigenstates among 
each other, a supposition which can be tested through the commutation properties of these opera- 
tors. One can show readily that the commutation relationships 

[hJi] = ifttktrJm (6.15) 

are satisfied, i.e., the operators Sk, fc = 1,2,3 do not commute. For a proof one uses (6.9), the 
properties [jj:^ , = ifoekimJm 1 for n = 1,2 together with the property [A, B + C] = 

[A,B] + [A,C]. 

We recognize, however, the important fact that the obey the Lie algebra of SO(3). According 
to the theorem above this property implies that one can construct eigenstates 2) jm of JJ3 and of 

f = Sj + f 2 + f 3 (6.16) 

following the procedure stated in the theorem above [c.f. Eqs. (5.71-5.81)]. In fact, we will find 
that the states yield the basis B' with the desired property of a maximal uncoupling of rotational 
states. 

Before we apply the procedure (5.71-5.81) we want to consider the relationship between 2)jm and 
^imi^Afl) Ye 2m2 (pc)- I n t ne following we will use the notation 



^1 = TAB , ^2 = PC 



(6.17) 
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6.1 Clebsch-Gordan Coefficients 

In order to determine 2)jm we notice that the states Y^ imi (f2i) Yg 2ril2 (Q 2 ) are characterized by four 
quantum numbers corresponding to eigenvalues of [J^] , J$~\ [J^] , and . Since 2)jm 
sofar specifies solely two quantum numbers, two further quantum numbers need to be specified 
for a complete characterization of the total angular momentum states. The two missing quantum 
numbers are i\ and £ 2 corresponding to the eigenvalues of [.y^] and [j^ 2 )] . We, therefore, 
assume the expansion 

2) jm {h,t2 1 ni,n 2 ) = Yl (JM\hm 1 e 2 m 2 )Y eirni (n 1 )Y e2m2 (n 2 ) (e.is) 

mi,m2 

where the states 2)jm(^i> ^2|^i> Pc) are normalized. The expansion coefficients (J M\£\ mi £ 2 m 2 ) 
are called the Clebsch-Gordan coefficients which we seek to determine now. These coefficients, 
or the closely related Wigner 3j- coefficients introduced further below, play a cardinal role in the 
mathematical description of microscopic physical systems. Equivalent coefficients exist for other 
symmetry properties of multi-component systems, an important example being the symmetry 
groups SU(N) governing elementary particles made up of two quarks, i.e., mesons, and three quarks, 
i.e., baryons. 

Exercise 6.1.1: Show that J 2 , JT 3 , {j {l) f , (J i2) ) 2 , and jW-jW commute. Why can states 2) JM 
then not be specified by 5 quantum numbers? 



Properties of Clebsch-Gordan Coefficients 

A few important properties of Clebsch-Gordan coefficients can be derived rather easily. We first 
notice that 2)jm in (6.18) is an eigenfunction of J3, the eigenvalue being specified by the quantum 
number M, i.e. 

J 3 2)jm = hMtyjM ■ (6.19) 

Noting JJ3 = + and applying this to the l.h.s. of (6.18) yields using the property 

J^ k) Y ikmk (n k ) = hm k Y Ekrnk (n k ) , k = 1,2 

Y (mi + m 2 )(JM\£imi£ 2 m 2 )Y hmi (ni)Y e2m2 (n 2 ) . (6.20) 

mi,m2 

This equation can be satisfied only if the Clebsch-Gordan coefficients satisfy 

(J M\£imi£ 2 m 2 ) = for m x + m 2 / M . (6.21) 
One can, hence, restrict the sum in (6.18) to avoid summation of vanishing terms 

VMliMtouSh) = ^(J M\hmi£ 2 M - mi)Y eimi (ni)Y e2m2 (n 2 ) . (6.22) 

mi 

We will not adopt such explicit summation since it leads to cumbersum notation. However, the 
reader should always keep in mind that conditions equivalent to (6.21) hold. 
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The expansion (6.18) constitutes a change of an orthonormal basis 

®(4,4) = {Y eimi (n 1 )Y e2m2 (n 2 ),m 1 = -4, -4 + 1, . . . ,4 , 

m 2 = -4,-4 + 1,.-., 4} , (6.23) 

corresponding to the r.h.s., to a new basis 23' (4, 4) corresponding to the l.h.s. The orthonormality 
property implies 

J dfti J dU 2 Ye imi (fti) Y^ 2m2 {Vt2)Y^ rn ' i Qi)Y e / 2m > 2 (Q 2 ) = ^i x t'^ mi m'^i 2 t'^>m2m' 2 - (6.24) 

The basis 05(4,4) has (24 + 1)(24 + 1) elements. The basis 53' (4, 4) is also orthonormal 2 and 
must have the same number of elements. For each quantum number J there should be 2 J + 1 
elements 2)jm with M = — J, — J + 1, . . . , J. However, it is not immediately obvious what the J- 
values are. Since 2)jm represents the total angular momentum state and Y^ imi (f2i) and Yi 2m2 {Q, 2 ) 
the individual angular momenta one may start from one's classical notion that these states represent 
angular momentum vectors J, JW and J^ 2 \ respectively. In this case the range of | J|-values would 

be the interval [ \J^\ — \ J <y2 ^\ , \J^\ + |<^ 2 ^|]. This obviously corresponds quantum mechanically 

to a range of J-values J = |4 — 4|, |4 — 4| + 1, • • • 4 + 4- In fact, it holds 

(2J + 1) = (24 + 1) (24 + 1) , (6.25) 

J=\h-i 2 \ 

i.e., the basis 03'(4, 4) should be 

® 2 = {2)jm(4, 41^1,^2); J = |4-4|, |4-4| + 1,4 + 4, 

M = —J, —J + 1, . . . , J} . (6.26) 

We will show below in an explicit construction of the Clebsch-Gordan coefficients that, in fact, the 
range of values assumed for J is correct. Our derivation below will also yield real values for the 
Clebsch-Gordan coefficients. 



Exercise 6.1.2: Prove Eq. (6.25) 



We want to state now two summation conditions which follow from the orthonormality of the two 
basis sets 03(4,4) and 23'(4,4)- The property 

J dfii J dfWjA/(4, 41^2) 9 j'm'(4, 4|^i,tt 2 ) = 5jj,8 MM ' (6.27) 

together with (6.18) applied to 2)} M and to 2J jim' and with (6.24) yields 

Y (JM|4mi4m 2 )*(J'M'|4mi4m 2 ) = S JJ' S mm> ■ (6.28) 

m\,m2 



2 This property follows from the fact that the basis elements are eigenstates of hermitian operators with different 
eigenvalues, and that the states can be normalized. 
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The second summation condition starts from the fact that the basis sets ^B(£i,£ z ) and 25'(^i,^ 2 ) 
span the same function space. Hence, it is possible to expand Y^ mi {yi-\)Y^ 2m2 {yi 2 ) in terms of 
Z)jm(£i, ^2|^i,^2), i.e., 

y* imi (ni)iWn 2 ) = Y Y c J>M'% J >M>(£i,£2\n 1 ,n 2 ) , (6.29) 

J'=\l x -t 2 \ M'=-J 

where the expansion coefficients are given by the respective scalar products in function space 

cj, M > = J J dn 2 ^j l M'(h,h\n 1 ,n 2 )Y eimi (n 1 )Y e2m2 (n 2 ) . (6.30) 

The latter property follows from multiplying (6.18) by %)*j, M ,(£i, ^2^1,^2) and integrating. The 
orthogonality property (6.27) yields 

Sjj'Smm' = ^ (JM|^imif 2 m2)cj'M' • (6.31) 

mi,m2 

Comparision with (6.28) allows one to conclude that the coefficients cj'M' are identical to 
(J' M'\£ 1 m 1 £ 2 m 2 )*, i.e., 

^imi(^l)^ 2 m 2 (^2) 

= Y Y ( J> M '\ti m i £2 m 2 )*Z) j, M '(£i, £2^1,^2) , (6.32) 

J'=|^-^ 2 | M'=-J 

which complements (6.18). One can show readily using the same reasoning as applied in the 
derivation of (6.28) from (6.18) that the Clebsch-Gordan coefficients obey the second summation 
condition 

YiJMlhrmhmtfiJMlhm^hm'J = t>„^„> 2 . (6.33) 

JM 

The latter summation has not been restricted explicitly to allowed J- values, rather the convention 
{J M\£ l m l £ 2 m 2 ) = if J < \h - £ 2 \ , or J > h + £ 2 (6.34) 

has been assumed. 

6.2 Construction of Clebsch-Gordan Coefficients 

We will now construct the Clebsch-Gordan coefficients. The result of this construction will include 
all the properties previewed above. At this point we like to stress that the construction will be based 
on the theorems (5.71-5.81) stated above, i.e., will be based solely on the commutation properties of 
the operators J and J^ k \ We can, therefore, also apply the results, and actually also the properties 
of Clebsch-Gordan coefficients stated above, to composite systems involving spin-^ states. A similar 
construction will also be applied to composite systems governed by other symmetry groups, e.g., 
the group SU(3) in case of meson multiplets involving two quarks, or baryons multiplets involving 
three quarks. 
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For the construction of 2) jm we will need the operators 

J± = Ji + 02 • (6.35) 

The construction assumes a particular choice of J G {|£i — ^1, |^i — ^2| + 1, • ■ ■ ^i + £2} and for 
such J-value seeks an expansion (6.18) which satisfies 

UVjj(h,£2\n 1 ,n 2 ) = (6.36) 

h%jj(tiMtoiM = hJ% J j(e 1 ,£ 2 \n 1 ,n 2 ) . (6.37) 

The solution needs to be normalized. Having determined such 2J jj we then construct the whole 
family of functions Xj = {2)jm(4>^|^i,^2), M = —J, — J + 1, ... J} by applying repeatedly 

S-^JM+lih,^!,^) = Hy/(J + M + 1)(J - MWjmWu £2piM ■ (6.38) 
for M = J — 1, J — 2, ... , —J. 

We embark on the suggested construction for the choice J = £\ + £ 2 . We first seek an unnormalized 
solution Qjj and later normalize. To find Qjj we start from the observation that Qjj represents the 
state with the largest possible quantum number J = £\ + £ 2 with the largest possible component 
M = l\ + t 2 along the z-axis. The corresponding classical total angular momentum vector Jclass 
would be obtained by aligning both J^ ss and J^ss a ^ so along the z-axis and adding these two 
vectors. Quantum mechanically this corresponds to a state 

Gt 1+ t a A+M^2piM = 5Wfii)lWft2) (6.39) 

which we will try for a solution of (6.37). For this purpose we insert (6.39) into (6.37) and replace 
according to (6.11) J+ by + j[ 2) . We obtain using (5.66,5.68) 

+ 4 2) ) ^(^1)^(^2) (6.40) 

= (4 1)y wi( n i)) y W n 2) + y Vl (^i)(4 2) ^ 2 (^)) = . 

Similarly, we can demonstrate condition (6.25) using (6.11) and (5.64) 

+ 4 2) ) 5Wfii)iW«2) 
= (^ 3 (1) *W^)) *W^) + y Vl (fii)(j3 (2) y 2 , 2 (^ 2 )) 
= M*i + ^2)^1(^1)^(^2) . (6.41) 

In fact, we can also demonstrate using (??) that Qe 1+ £ 2i e 1+ t 2 (£i,£ 2 \Qi, Q 2 ) is normalized 

J dUi J dO, 2 ge 1 +e 2 /i+e 2 (h, £2^1,^2) 

= (^J d^Yw^fj dSl 2 Y l2la (Sl 2 )) = 1. (6.42) 

We, therefore, have shown 

^i+^A+^^i.^lfii,^) = iwni)iwn 2 ). (6.43) 
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We now employ property (6.38) to construct the family of functions B^ + £ K = 
{Vlv+e* m(4k, 4^f, 5^), M = -(4k + 4^), . . . , (4k + 40}- We demonstrate the procedure ex- 
plicitly only for M = 4+42-1- The r.h.s. of (6.38) yields with J_ = + jl^ the ex- 
pression_^v/2ZI Y txlx _ x (Oi)Y^ 2 (0 2 ) + ft\/2l 2 ~Y Wl _i(ni)Y^ 2 _i(n 2 )- The l.h.s. of (6.38) yields 
ft^2(4 + 4^i+^i +£ 2 -i(^i> 4|^i,^2)- One obtains then 

2)^ + ^ €l+€2 _ 1 (4,42|^i,^2) = (6.44) 

y^y^-i^i)^^) + y^y^i^i)^-!^) . 

This construction can be continued to obtain all 2(4 + 4) + 1 elements of B^ + ^ and, thereby, 
all the Clebsch-Gordan coefficients (£1 + l2M\£\m\l2rri2) . We have provided in Table 1 the explicit 
form of 2)jm(44|^i^2) for £\ = 2 and 4 = 1 to illustrate the construction. The reader should 
familiarize himself with the entries of the Table, in particular, with the symmetry pattern and with 
the terms Y£ imi Yz 2m2 contributing to each 2)jm- 

We like to construct now the family of total angular momentum functions E^ F+ ^_^ = 
{2)^+^-fm(4k, 4^f, M M = -(4k + 4* - ¥), . . . , (4k + 4* - J^)}. We seek for this pur- 
pose first an unnormalized solution ^ 1+ £ 2 _ 1 ^ 1+ ^ 2 _ 1 of (6.36, 6.37). According to the condition 
(6.21) we set 

ge 1+ t a -u 1+ t2-i(W^2) = ^ 1 -i(0 1 )y^ 2 (n 2 ) + cY^n^Y^^) (6.45) 

for some unknown constant c. One can readily show that (6.37) is satisfied. To demonstrate (6.36) 
we proceed as above and obtain 

+ ciw^i) (4 2) ^ 2 -i(^)) 

= (^24 + c^lth) Y lltl {Sli)Y l2l2 (Sl 2 ) = . (6.46) 

To satisfy this equation one needs to choose c = — \f£\f£~2- We have thereby determined 
Gi 1+ £ 2 -ie 1+ £ 2 ~i in (6.45). Normalization yields furthermore 

Ve 1+ e 2 -u 1+e2 -i(£i£2\^2) (6.47) 
^-y, 1 , 1 _ 1 (n 1 )y, 2 , 2 (n 2 ) - J jJL-YtMSlJYe^Slt) . 

This expression is orthogonal to (6.39) as required by (6.27). 

Expression (6.47) can serve now to obtain recursively the elements of the family B^ + ^_jk for 
M = £\ + 42 — 2, 4 + 42 — 3, . . . , — (4 + £2 — 1). Having constructed this family we have determined 
the Clebsch-Gordan coefficients (4 + 4 — lM|4"Zi4"i2)- The result is illustrated for the case 
4 = 2, 4 = 1 in Table 1. 

One can obviously continue the construction outlined to determine ^}ei+& 2 -2e 1 +e 2 -2, etc. and all 
total angular momentum functions for a given choice of 4 and 4- 



Exercise 6.2.1: Construct following the procedure above the three functions 2) jm(4> 4|^i> ^2) 
for M = 4 + £2 - 2 and J = 4 + 4,4 + £2 - 1, 4 + £2 - 2. Show that the resulting functions 
are orthonormal. 
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i22(Oi)Yn(0 2 ) 






2)33(2,110!, 02) 


1 








>2i(Oi)Fii(0 2 ) 


^22(Ol)Fl (O 2 ) 




2) 32 (2,l|Oi,02) 


J\ ~ 0.816497 


W± ~ 0.57735 




2)22(2, l|Oi,^2) 


-^J ~ -0.57735 


y§ ~ 0.816497 






i2o(Oi)Fn(0 2 ) 


i2i(Oi)Fi (O 2 ) 


l22(Ol)Fl_l(0 2 ) 


m /OIID DO"! 

-4J3ll^, l|i'l, i'^J 


a / ^ ~ U.QOZ40D 


/ 8 ^ n 7QH007 

y — U . 1 oUZy f 


/l n o^q 1 on 

a / j£ — u.zooiyy 


2) 2 i(2,l|Oi,02) 


~ -0.707107 


i/i ~ 0.408248 
V ° 


^/l ~ 0.57735 


2)n(2,l|Oi,02) 


~ 0.316228 


- vS - -°- 547723 


y§ ~ 0.774597 




F 2 -i(Oi)yii(o 2 ) 


^o(0i)yio(o 2 ) 


y 2 i(Oi)Fi_i(o 2 ) 


2)30(2,11^,02) 


A A ~ 0.447214 

v 5 


y — 

J| ~ 0.774597 


J\ ~ 0.447214 

V <> 


2)20(2,110!, 02) 


-y^ ~ -0.707107 





y| ~ 0.707107 


2)i (2,l|Oi,O2) 


~ 0.547723 


-^f ~ -0.632456 


~ 0.547723 




F2-2(Oi)Fii(0 2 ) 


y 2 -i(Oi)Yio(0 2 ) 


F 2 o(Oi)Fi-i(0 2 ) 


2)3-1(2,110!, 02) 


^ ~ 0.258199 


^ ~ 0.730297 


y/| ~ 0.632456 


2)2-i(2, l|Oi, 02) 


-yj\ - -°- 57735 


-\fi ^ -0.408248 
V ° 


^1 ~ 0.707107 


2)i_i(2,l|Oi,02) 


y§ ~ 0.774597 


- -0-547723 


~ 0.316228 






y 2 _ 2 (Oi)Yi (O 2 ) 


Y2_i(ni)Yi_i(n 2 ) 


2)3-2(2,110^02) 




^/i ~ 0.57735 


^/| ~ 0.816497 


2)2-2(2,110!, 02) 




~yi - -0-816497 


yj\ ~ 0.57735 








^ 2 - 2 (Oi)yi-i(o 2 ) 


2)3-3(2, l|Oi, 02) 






1 



Table 6.1: Some explicit analytical and numerical values of Clebsch-Gordan coefficients and their 
relationship to the total angular momentum wave functions and single particle angular momentum 
wave functions. 
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Exercise 6.2.2: Use the construction for Clebsch-Gordan coefficients above to prove the following 
formulas 



{J,M\i,m-\,\,+\) 
(J,M\£,m +§,§,-§} 



J+M 
2J 



J-M+l 
2.J+2 



for I = J - \ 
for I = J + \ 



J-M 
2.J 



for t = J 



The construction described provides a very cumbersome route to the analytical and numerical 
values of the Clebsch-Gordan coefficients. It is actually possible to state explicit expressions for 
any single coefficient ( J M\l\m\l2rri2) ■ These expressions will be derived now. 

6.3 Explicit Expression for the Clebsch-Gordan Coefficients 

We want to establish in this Section an explicit expression for the Clebsch-Gordan coefficients 
{J M\t\m\l2m2) . For this purpsose we will employ the spinor operators introduced in Sections 5.9, 
5.10. 

Definition of Spinor Operators for Two Particles 

In contrast to Sections 5.9, 5.10 where we studied single particle angular momentum and spin, we 
are dealing now with two particles carrying angular momentum or spin. Accordingly, we extent 
definition (5.287) to two particles 



{1) Jk = 5Ec,c a c <CkfclC > «c 



(6.48) 
(6.49) 



where (, £' = ± and the matrix elements < Q \a^\ (' > are as defined in Section 5.10. The creation 
and annihilation operators are again of the boson type with commutation properties 

[d£, ct£/] = a£, =0 , a^, a^, = S^i (6.50) 

[6 C , V] = [b\, b\] = , [& f , &£,] = 6 CC . (6.51) 

The operators a^, and b^, refer to different particles and, hence, commute with each other 



[o c , b c 



a\, b\, 



a f , bl, 



. 



(6.52) 



According to Section 5.10 [cf. (5.254)] the angular momentum / spin eigenstates \£iirii)i and 1^2^2)2 
of the two particles are 

(nxY i+mi r„tV i_mi 

l^o) (6-53) 



\hmi)i 
\hm 2 )2 



y/(ii+mi)\ V(^i- m i) ! 

t v l 2 +m 2 I t \ t 2 -m 2 



I*0> • 



(6.54) 
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all [all bl 



It holds, in analogy to Eqs. (5.302, 5.303), 

Wj 2 \£ 1 m 1 ) 1 = £ 1 (£ 1 + l)\£ 1 m 1 ) 1 , ^Mhm^r = m 1 \£ 1 m 1 ) 1 (6.55) 
Wj 2 \l 2 m 2 )2 = ^2(^2 + 1)1^2)2, {2) ^2^2)2 = m 2 \i 2 m 2 )2 ■ (6.56) 

The states \£i,mi)i\£ 2 ,m 2 } 2 , which describe a two particle system according to (6.53, 6.54), are 
\h,mi)i \h,m 2 ) 2 = (6.57) 

_A L I* \ 

y/(h + mi)! y/(h - mi)! ^ + m 2 )\ V ^ - m 2 )! 

The operator of the total angular momentum/spin of the two particle system is 

1 = W J + ( 2 ) J (6.58) 

with Cartesian components 

Jfc = + {2) «/fe ; fc = 1,2,3 . (6.59) 

We seek to determine states \J, M(£±,£ 2 )) which are simultaneous eigenstates of the operators 
J 2 , JJ3 , ^ J , ( 2 )J which, as usual are denoted by their respective quantum numbers J, M, £i,£ 2 , 
i.e., for such states should hold 

S 2 \J,M(£ U £ 2 )) = J(J + 1) I J, M(£i,£ 2 )) (6.60) 

h\J,M{h,i 2 )) = M\J,M(£ U £ 2 )) (6.61) 

^J 2 \J,M{h,£ 2 )) = + l)\J,M(£ u e 2 )) (6.62) 

Wj 2 \J,M(£ u £ 2 )) = £ 2 {£ 2 + l)\J,M{£ l ,£ 2 )) . (6.63) 

At this point, we like to recall for future reference that the operators Wj 2 , ( 2 )j 2 , according to 
(5.300), can be expressed in terms of the number operators 

fci = \(a+a+ + aLa_) , k 2 = X - (b\b+ + &H_ ) , (6.64) 

namely, 

0')j 2 = kj {kj + 1) , j = 1, 2 . (6.65) 

For the operators &j holds 

kj\£j,mj)j = ^-Ifj.m^j (6.66) 

and, hence, 

kj\J,M{£ u £ 2 )) = tj\J,M{l u l 2 )) (6.67) 

We will also require below the raising and lowering operators associated with the total angular 
momentum operator (6.59) 

J± = Ji ± ifo ■ (6.68) 
The states \J, M(£i,£ 2 )} can be expressed in terms of Clebsch-Gordan coefficients (6.18) as follows 
\J,M(h,£ 2 )} = E mi , m2 l4,mi)i|^,m 2 ) 2 (£i,mi,^,m 2 |J,M(^i,£ 2 )) , 

\£i - £ 2 \ < J < h + £ 2 , -J < M < J . (6.69) 

The aim of the present Section is to determine closed expressions for the Clebsch-Gordan coefficents 
{£ x ,m x ,£ 2 ,m 2 \J,M{£ x ,£ 2 ))- 
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The Operator if* 

The following operator 



ift 



albl 



atbl 



(6.70) 



will play a crucial role in the evaluation of the Clebsch-Gordan-Coefficients. This operator obeys the 
following commutation relationships with the other pertinent angular momentum / spin operators 



L, ift 



2 



i = i, 2 

3 



[Wj 2 , /ft] = K^kj + - ift , j = 1, 2 

[J 3 , ^] = o 
[J±, K^] = . 

We note that, due to JT 2 = ^J+J_ + \j-J+ + J 2 , the relationships (6.73, 6.74) imply 

J 2 , if 1 "! = . 



(6.71) 

(6.72) 

(6.73) 
(6.74) 

(6.75) 



The relationships (6.71-6.73) can be readily proven. For example, using (6.64, 6.50, 6.51) one 
obtains 



k, if f 



a} + a+ + a_a-, a' + b_ — aLb\_ 



a+, a 



bl - -al 



a_, a 



A similar calculation yields [A^ift] = 
[OV 2 , ift] = 



|ift. Employing (6.65) and (6.71) one can show 
%(fc J + l),if^ 

fc* + 1, ift] + - if f l (kj + 1) 



= - kj 



2 L 



+ Iift(% + 1) 



ifU, + 



ift 



+ -ift 

2 



= if t jfe + - if t . 
4 

Using J 3 = {1) J 3 + (2) J 3 , expressing ^ J3 through the creation and annihilation operators accord- 
ing to (5.288), and applying the relationships (6.71-6.73) yields 



1 

2 



a+, a 



aLa_ + 



blb_, albl 



albl 



b+,b 



+ 



a_, a 



= . 
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Starting from (5.292) one can derive similarly 



ala_ 



bib-, a\b 



albl 



a_, at 



bt + bUl 



b-, bl 



The property [JT_ , /ft] = is demonstrated in an analoguous way. 
Action of ift on the states |J, M(£i,£ 2 )) 

We want to demonstrate now that the action of on the states |J, M(£i,.£ 2 )) produces again 
total angular momentum eigenstates to the same J and M quantum numbers of J 2 and J3, but for 
different l\ and £2 quantum numbers of the operators ^' J 2 and ( 2 >J 2 . 

The commutation properties (6.73, 6.75) ascertain that under the action of the states 
\J, M(£i,£2)) remain eigenstates of J 2 and S3 with the same quantum numbers. To demonstrate 
that the resulting states are eigenstates of ^ J 2 and ^ J 2 we exploit (6.72) and (6.62, 6.63, 6.67) 

Wj 2 ift|J,M(^,£ 2 )) = Q^J 2 , ift] + KtWj 2 ) \J,M{l u l 2 )) 

= fift% + j^t + Kt^.(^. + 1 )^ |j,M(^^ 2 )) 

= ^ f + 1 + M^j + l)) |J,M(^i,£ 2 )) 



& + 



& + 



K*\J,M(£ 1 ,£ 2 )} 



However, this result implies that K t| J, M(£\, £2)) is a state with quantum numbers £\ + \ and 
^ 2 + |, i.e., it holds 

K^\J,M{£ U £ 2 )) = N\J,M{l l + ^l 2 + 1 -)) . (6.76) 



Here iV is an unknown normalization constant. 
One can generalize property (6.76) and state 



tftj \J,M(e u e 2 )) = N'\J,M(£ 1 + -,£2 + -)) 



(6.77) 



where N' is another normalization constant. We consider now the case that (ift) n acts on the 
simplest total angular momentum / spin state, namely, on the state 



\ji+h,3i+h(h,h)) = \ji,3i)i \h, 32)2 



(6.78) 



a state which has been used already in the construction of Clebsch-Gordan coefficients in Section 6.2. 
Application of (K<) n to this state yields, according to (6.77), 



\3i+ 32,31 + 32 (31 + ^,32 + ^)) 



(6.79) 



= N(n,jil + -, ji + -) (K ) \h+ 32,h+ 32(3i,h)) 
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where we denoted the associated normalization constant by N{n,j\ + §, J2 + §)• 

It is now important to notice that any state of the type \J, J(£i, £2)) can be expressed through the 

r.h.s. of (6.79). For this purpose one needs to choose in (6.79) n,ji,j 2 as follows 



n 



J = ji + h , h = ji + 2 > ^2 



n 

J2 + 2 



which is equivalent to 



Ji 

J2 



= £1 + £2 - J 

n 1 

2 2 

n 1 

2 ~ 2 



£2 



(J + h 
{J + £1 



Accodingly, holds 



\J,J(£ 1 ,£ 2 )) = N(£ 1 +£ 2 - J,h,£ 2 ) (^) 



x \ 1 -{J + h - £ 2 ),\(J + h 
x \\{J + h - h)\{J + £2 

The normalization constant appearing here is actually 



£2))l X 
h))2 ■ 



(2J + 1)! 

_{h + £ 2 - J) \ (£1 + £ 2 + J + 1)1 



n(£ 1 + £ 2 - j,e u e 2 ) 

The derivation of this expression will be provided further below (see page 158 ff). 



(6.80) 



(6.81) 



(6.82) 



(6.83) 



Strategy for Generating the States \ J, M(£i,£ 2 )) 

Our construction of the states \ J, M{£\,£2)) exploits the expression (6.82) for \J, J(£i,£ 2 ))- The 
latter states, in analogy to the construction (5.104, 5.105) of the spherical harmonics, allow one to 
obtain the states |J, M{£\,£2)) for — J < M < J as follows 



\J,M(£ 1 ,£ 2 )} 
A(J,M) 



A(J,M) (J_) J - M \J,J(£i,£ 2 )) 

(J + M)\ ]5 
_(2J)!(J - M)\_ ' 



(6.84) 
(6.85) 



Combining (6.84) with (6.82, 6.57) and exploiting the fact that J_ and commute [c.f. (6.74)] 
yields 



|J,M(^ 2 )> = Nj£ 1+ £2-JMA2)^M) 
V(J + h-£2y.(J + £2-£iV. 



(6.86) 



h+e 2 -J 



(J-)* 



l*o) 
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Our strategy for the evaluation of the Clebsch-Gordan-coefficients is to expand (6.86) in terms of 
monomials 

/ + \£i+m 1 / . \<i-mi / . \i 2 +m 2 / . \£ 2 -m 2 

(4) («-) K) (bl) l*o>, (6-87) 

i.e., in terms of \£\, mi)i|^2, "22)2 [cf. (6.57)]. Comparision with (6.69) yields then the Clebsch- 
Gordan-coefncients. 

Expansion of an Intermediate State 

We first consider the expansion of the following factor appearing in (6.86) 

\G r st ) = f.(«i) S (6l)V») (6-88) 
in terms of monomials (6.87). For this purpose we introduce the generating function 

I(X,x,y) = exp(AJ_) exp (xa\_^ exp(xb\_^J |* ) . (6.89) 
Taylor expansion of the two exponential operators yields immediately 

T ^y) = E^f (6 - 90) 

r,s,t 

i.e., I(X,x,y) is a generating function for the states \G r st ) defined in (6.88). 

The desired expansion of \G r st ) can be obtained from an alternate evaluation of I(X,x,y) which is 
based on the properties 

a c /(<4)|* > = -^/( a t)|¥ ) &</(&[) |* > = A/(4)|* ) (6.91) 

which, in analogy to (5.264), follows from the commutation properties (6.50-6.52). One obtains 
then using 

J_ = a ] _a+ + bl b + (6.92) 
and noting [a+, aL] = [b+, bl] = [cf. (6.50)] 



exp ( AJ_ ) f(a\.) g(bl) = exp ( ala + ) f(a\.) exp ( tfb+ ) g(b\_) 



x 



A6 



t ^ 



?^(4j «<*>'••> 

/(4 + A at) g(b\. + Xbl) |* ) . (6.93) 
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We conclude 

I(X,x,y) = exp ( a\ + A at ) exp(oJ_ + A fel ) |* ) . (6.94) 

One can infer from this result the desired expressions for \G r st ). Expanding the exponentials in 
(6.94) yields 

i(x,x, y ) = E^yfr (°+ + Aa -) S ( & + + Afct -)* 1*°) 

U \ V 

s,t t v 

x (a+)"~ U A" (aiy (bl)^ A" (V ) 
^ r! s! i! 2.^1 ' \ q I \ r — q 

r,s,t q \ / \ 

x (^)"- t ( '_)'( 6 t)'- r+ «( 6 »_)'-«|* ) (6.95) 
Comparision with (6.90) allows one to infer 



9 

+</ 



(a + ) - (at)' ( 6 t + )" + ' (^-LJ (6.96) 
and, using the definition (6.88), one can write the right factor in (6.86) 

(J_) J - M (a^) (V + ) (6.97) 

(J-M)!(J + ^-^ 2 )!(J + ^-^)! 



x a 



(at) (^) (&L) |* ) 



g!( J + - i 2 - q)\(J - M - q)\(M + £ 2 ~h + ?)! 
,/+<?i-C 2 -<? / . \<j / . \M+l 2 -h+q / + \J-M-q 



Final Result 



Our last step is to apply the operator (K') 1+ 2 J to expression (6.97), to obtain the desired 
expansion of |J, M{li,l<i)) in terms of states |^i, mi)i|^2, "7,2)2- With given by (6.70) holds 



(tf) tl+e *- J = ( h + 1" " J ) ("1) S (6-98) 



all (6M (at) (6 t 



Operation of this operator on (6.97) yields, using the commutation property (6.50), 

|J,M(^ 2 )) = ^(-l) s (6.99) 
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(h + £ 2 - J)!( J - M)!( J + £ 1 - l 2 )!( J + i 2 - h)\ 

+ £ 2 - J - s)!g!( J + i x - £ 2 - q)\(J -M-q)\(M + e 2 -£ 1 + q)\ 

/ . s2i 1 -q-s / , \ g+s / . \ M+i 2 -h+q+s / . \ £i+t2—M—q—s 

(4) («'-) ( fet + ) i*o> 

The relationships (6.53,6.54) between creation operator monomials and angular momentum states 
allow one to write this 



x 



(*i + ^ - J)!(J - M)!(J + - £ 2 )!(J + ^2 - 4)! 



+ ^2 - J - s)\q\(J + £\ — £ 2 — q)\(J — M — q)\{M + £ 2 -£i+q)\ 
Xy/{2£ l -q-s)\{q + s)\ 

X^iM + £ 2 - h + q + sy.ih + £ 2 - M - q - s)l 
x\h,£i -q-s}! \£ 2 , M - £i + q + a) 2 

One can conclude that this expression reproduces (6.69) if one identifies 

mi = £ 1 - q - s , m 2 = M-£ l + q + s. (6.101) 

Note that mi + m 2 = M holds. The summation over q corresponds then to the summation 
over mi, m 2 in (6.69) since, according to (6.101), q = £\ — m\ — s and m 2 = M — m\. The 
Clebsch-Gordan coefficents are then finally 

(£i,mi,£ 2 ,m 2 \J,M) = 

zyy— \ £i +£2- jjm -£2 + j)K-h +£ 2 + jy. 

V + [ (£ 1 + £ 2 + J+l)l 

x [(h + mi)!(£i - rrn)\(£ 2 + m 2 )\(£ 2 - m 2 )!( J + M)!( J - M)p 
("l) s 



E 



X 

s\(£i — mi — s)\(£ 2 + m 2 — s)\ 

1 



(£1 + £ 2 — J — s)\(J -h-m 2 + s)\(J -£ 2 + mi + s)\ 



(6.102) 



The Normalization 



We want to determine now the expression (6.83) of the normalization constant N{£\ +£ 2 — J, £i,£ 2 ) 
defined through (6.82). For this purpose we introduce 

ji = \{J + £1 - £2) , 32 = g(J + £2 - £1) , n = h + £ 2 - J . (6.103) 

To determine N = N(£±+£ 2 — J, £±, £ 2 ) we consider the scalar product (J, J(£i,£ 2 )\J, J(£i,£ 2 )) = 1. 
Using (6.82) and (6.103) this can be written 

1 = N 2 ^{j u j 2 ,n)\^{j u j 2 ,n)) (6.104) 
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where 



\^{juh,n)) = (K^ |il, Jl>l|j2, J2>2 • 



(6.105) 



The first step of our calculation is the expansion of tp(ji, j2,n) in terms of states mi)i\j2, m 2 ) 2 - 
We employ the expression (6.57) for these states and the expression (6.70) for the operator . 
Accordingly, we obtain 



V(2ji)K2j 2 )! 

2ji 



2i2 



(at y (,t y |Vo> = 

(-1) V(2ji + n - s)!s!(2j 2 + s)!(n - s)! 



V(2ji)!(2j 2 )! 

2jl— 71— s 



E 



2J2+S 



s!(n — s)! 



l*o>. 



^{2j l +n-s)\s\{2j 2 + s)\{n-s)\ 
The orthonormality of the states occurring in the last expression allows one to write (6.104) 



(6.106) 



1 = N 2 



(n\Y 



(2ji)!(2j 2 ) 



E 



{2j l +n-s)\{2j 2 + s)\ 
s\(n — s)\ 



= (n!) 2 £ 

s 

The latter sum can be evaluated using 



2ji + n - s \ ( 2j 2 + s 



2ji 



2J2 



ni+l 



= E 



mi 



ni + mi 
ni 



,7711 



(6.107) 



(6.108) 



a property which follows from 



1 



QU 



d\ u \ 1 - A 



711 + 1 



(ni + 1/)! 



A=0 



ni! 



(6.109) 



and Taylor expansion of the left hand side of (6.108). One obtains then, applying (6.108) twice, 



1 



ni+l 



1 - 

which can be written 



1 



1 - A 



712 + 1 



E 



7711,7712 



771+712+2 



E 



1 - A 

Comparision with (6.108) yields the identity 

E 



E 



n\ + m\ 

m 



n\ + r — s 

m 



n 2 + m 2 
n 2 



n 2 + s 
n 2 



A mi 



+m 2 



A r 



n\ + r — s 

m 



n 2 + s 
n 2 



ni + n 2 + r + 1 
ni + n 2 + 1 



(6.110) 



(6.111) 



(6.112) 
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Applying this to (6.107) yields 



N 2 (n\y 



2ji + 2j 2 + n + 1 
2ji + 2j 2 + 1 



= 2 nl(2 jl + 2 J - 2 + n + l)! 

(2j 1 +2j 2 + 1)! v 7 

Using the identities (6.103) one obtains the desired result (6.83). 

6.4 Symmetries of the Clebsch-Gordan Coefficients 

The Clebsch-Gordan coefficients obey symmetry properties which reflect geometrical aspects of the 
operator relationship (6.11) 

1 = jW + jW . (6.114) 
For example, interchanging the operators J^> and results in 

1 = J {2) + J (1) • (6.115) 

This relationship is a trivial consequence of (6.114) as long as 1, jW, and are vectors in 
K^. For the quantum mechanical addition of angular momenta the Clebsch Gordan coefficients 
(£i , m\ , £ 2 , m.2 1 J, M) corresponding to (6.114) show a simple relationship to the Clebsch Gordan 
coefficients (£2,^2,^1,^1!^,^) corresponding to (6.115), namely, 

(£ 1 ,m 1 ,£ 2 ,m 2 \J,M) = (-1)^+^- J (£ 2 , m 2 , h, mi| J, M) . (6.116) 

If one takes the negatives of the operators in (6.114) one obtains 

-1 = -jW - jP) . (6.117) 

The respective Clebsch-Gordan coefficients (£\, —mi,£ 2 , — m\l\ J, — M) are again related in a simple 
manner to the coefficients (£i,mi,£ 2 ,m 2 \J, M) 

(e 1 ,m 1 ,£ 2 ,m 2 \J,M) = (-1)^ 1+ ^ 2_J (£1, —mi,£ 2 , —m 2 \J, —M) . (6.118) 

Finally, one can interchange also the operator J on the l.h.s. of (6.114) by, e.g., J^> on the r.h.s. 
of this equation 

= j(2) _ J _ 

The corresponding symmetry property of the Clebsch-Gordan coefficients is 



{£ u m u £ 2 ,m 2 \J,M) = \J (^2, -m 2 , J, M\l u mi) . (6.120) 

The symmetry properties (6.116), (6.118), and (6.120) can be readily derived from the expression 
(6.102) of the Clebsch-Gordan coefficients. We will demonstrate this now. 

To derive relationship (6.116) one expresses the Clebsch-Gordan coefficient on the r.h.s. of (6.116) 
through formula (6.102) by replacing (4,mi) by {£ 2 ,m 2 ) and, vice versa, (£ 2 ,m 2 ) by and 
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seeks then to relate the resulting expression to the original expression (6.102) to prove identity with 
the l.h.s. Inspecting (6.102) one recognizes that only the sum 

S^m^m^M) = £ sl{li _ mi ^l 2 + m2 _ s)l 

' (6.121) 



(£ 1 + £ 2 -J- s)\(J -£ 1 -m 2 + s)\(J -e 2 + mi + S y. 

is affected by the change of quantum numbers, the factor in front of S being symmetric in (£i, mi) 
and (£ 2 ,m 2 ). Correspondingly, (6.116) implies 

S(£ 1 ,m 1 ,£ 2 ,m 2 \J,M) = J S(£ 2 , m 2 ,h, mi \ J, M) . (6.122) 

To prove this we note that S on the r.h.s. reads, according to (6.121), 



s\{£ 2 — m 2 — s)\(£i + mi — s)\ 
1 



(6.123) 



(£i + £ 2 — J — s)!(J -£ 2 -mi + a)!( J - h + m 2 + s)l 
Introducing the new summation index 

s' = £i + £ 2 - J - s (6.124) 
and using the equivalent relationships 

s = £i + £ 2 - J - s' , -s = J £i - £ 2 + s' (6.125) 

to express s in terms of s' in (6.123) one obtains 
S(£ 2 ,m 2 ,£i,mi\J,M) = 



V ; V (h + £2~ J-s')\{J-£i-m 2 + s')\{J-£ 2 + mi + s')\ 



1 



s'l(£i - mi - s')l(£ 2 + m 2 - s')l 



(6.126) 



Now it holds that £i + £ 2 — J in (6.124) is an integer, irrespective of the individual quantum 
numbers £\, £ 2 , J being integer or half-integer. This fact can best be verified by showing that the 
construction of the eigenstates of {J^> + J^) 2 and (J^ + J^)z in Sect. 6.2 does, in fact, 
imply this property. Since also s in (6.102) and, hence, in (6.122) is an integer, one can state that 
s', as defined in (6.124), is an integer and, accordingly, that 

(-1)^' = (-1)*' (6.127) 

holds in (6.126). Reordering the factorials in (6.126) to agree with the ordering in (6.121) leads 
one to conclude the property (6.122) and, hence, one has proven (6.116). 
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To prove (6.118) we note that in the expression (6.102) for the Clebsch-Gordan coefficients the pref- 
actor of S, the latter defined in (6.121), is unaltered by the change mi, m 2 , M — > —mi, -m2, — M. 
Hence, (6.118) implies 

S(e 1 ,m 1 ,e 2 ,m 2 \J,M) = (-l) ei+i2 - J S(£ 1 ,-m 1 ,e 2 ,-m 1 2\J,-M) . (6.128) 

We note that according to (6.121) holds 

S(£ 1 ,-m 1 ,£ 2 ,-m 1 2\J,-M) = £ — — ^ 

s\(£\ + mi — s)\(£ 2 — m 2 — s)\ 

1 

(6.129) 



(£i + £ 2 — J - s)!(J - 4 + m 2 + s)!(J - 4 - mi + a)! ' 

Introducing the new summation index s' as defined in (6.124) and using the relationships (6.125) 
to replace, in (6.129), s by s' one obtains 

S(£i,—mi,£ 2 ,—m 2 \J,—M) = 

(_i)h+h-J sr^ (~ 1 )~ s 

V ; Y {h + £2-J-s')\{J-£ 2 + m l + s')\{J-h-m 2 + s')\ 

' (6.130) 



s'\(£ 2 + m 2 - s')\(£i - mi - s')\ 



For reasons stated already above, (6.127) holds and after reordering of the factorials in (6.130) to 
agree with those in (6.121) one can conclude (6.128) and, hence, (6.118). 

We want to prove finally the symmetry property (6.120). Following the strategy adopted in the 
proof of relationships (6.116) and (6.118) we note that in the expression (6.102) for the Clebsch- 
Gordan coefficients the prefactor of S, the latter defined in (6.121), is symmetric in the pairs of 
quantum numbers (^i,mi), (£ 2 ,m 2 ) and (J, M), except for the factor \JU + 1 which singles out 
J. However, in the relationship (6.120) this latter factor is already properly 'repaired' such that 
(6.120) implies 

S{£ u m u £ 2 ,m 2 \J,M) = (-1)^+"* S(£ 2 , -m 2 , J, M\£ u mi) . (6.131) 
According to (6.121) holds 

(-!)• 



S(£ 2 ,-m 2 , J,M|£i,mi) = 



s 

1 



s\{£ 2 + m 2 - s)\(J + M - s)\ 



(6.132) 



(£ 2 + J — £1 — a)!(4 — £ 2 — M + s)!(£i — J — m 2 + s)\ 
Introducing the new summation index 

s' = £ 2 + m 2 - s (6.133) 

and, using the equivalent relationships 

s = £ 2 + m 2 - s' , -s = -£ 2 - m 2 + s' (6.134) 
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to replace s by s' in (6.132), one obtains 

S(e 1 ,-m 1 ,e 2 ,-m 1 2\J,-M) = 



(_- [ \£2+ m 2 V" V 1 > 

V ; ^ {t2 + m 2 -si)\si\{J-l2 + m 1 + s>)\ 



X (J - h - m 2 + s')!(4 - mi - s') ! (^i + ^2 - J - s') ! ' ^" 13 ^ 

Again for the reasons stated above, (6.127) holds and after reordering of the factorials in (6.135) 
to agree with those in (6.121) one can conclude (6.131) and, hence, (6.120). 

6.5 Example: Spin— Orbital Angular Momentum States 

Relativistic quantum mechanics states that an electron moving in the Coulomb field of a nucleus 
experiences a coupling ~ J • S between its angular momentum, described by the operator J and 
wave functions Y£ m (r), and its spin-^, described by the operator S and wave function xi+i- As a 

z ' 2 2 

result, the eigenstates of the electron are given by the eigenstates of the total angular momentum- 
spin states 

yjm(£,h\r) = ^,m , ,i,a|j,m)y, m ,(f)xi (T ( 6 - 136 ) 

m' ,cr 

which have been defined in (6.18). The states are simultaneous eigenstates of (J^ tot ^) 2 , J^ tot \ J 2 , 
and S 2 and, as we show below, also of the spin-orbit coupling term ~ J ■ S. Here J^ tot ^ is defined 
as 

j(tot) = j> + g (6.137) 
Here we assume for S the same units as for J, namely, h, i.e., we define 

S = (6.138) 

rather than (5.223). 

Two-Dimensional Vector Representation 

One can consider the functions xi+i to be represented alternatively by the basis vectors of the 

2 2 

space 

H\ = ( o ) ■ " ( i ) ' < 6139) 

The states yj m (£, ||f), accordingly, can then also be expressed as two-dimensional vectors. Using 

m' = m - u ; a = ±± (6.140) 

one obtains 

y jm (i,m = ^,m-i,i,^\j,m)Y em _i(f) ( I 

+ (£,m+i,-i,i\j,m)Y tm+ i(f) ( ° ) (6.141) 
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or 



(£,m- m)Y tm _i(f) 
{£,m + |, — |, m)Y hn+ i[r) 



(6.142) 



In this expression the quantum numbers (£, m') of the angular momentum state are integers. Ac- 
cording to (6.140), m is then half-integer and so must be j. The triangle inequalities (6.220) state 
in the present case \£ — \\ < j < £ + \ an d, therefore, we conclude j = £ ± § or, equivalently, 
£ = j ± 1. The different Clebsch-Gordon coefficients in (6.141) have the values 



(j - 5> m - 5>3>3U> m ) 

(j - i, m + i, |, — 1| j,m) 
(j + ±,m- i,|,i|j,m) 
{j + i,m + i, |, -§| j,m) 



Ij+m 



2j 



m 



2i 

/ j — m + 1 
V 2j + 2 

/j + m + 1 
2j + 2 



(6.143) 
(6.144) 
(6.145) 
(6.146) 



which will be derived below (see pp. 170). Accordingly, the spin-orbital angular momentum states 
(6.141, 6.142) are 



yjm{j 5, ||^) 

Eigenvalues 

For the states (6.147, 6.148) holds 

(J + s) 3 y jm (jTh^) 



i + m Y i iff] 
i~ m Y. i iff) 

2j I j-±m+±\') 



-4 1 



-m+l 



Y j+hm _i(f) 



2j + 2 ± j+i 1 m- 2 

+ m+l y I- 
2j + 2 1 j+\m+\V 



(6.147) 
(6.148) 



S 2 y jm (jTh,h\r) 



n 2 j(j + i)y jr n(jTbh\r) 

hmy jm (j =F |, \\r) 

- A n 2 y jm {jTbh\r) 



Furthermore, using 
or, equivalently, 



(j(tot)j2 = (j + 5) 2 = j 2 + s 2 + 2J ■ S 
2J-S = (j( tot ^) 2 - J 2 - S 2 



(6.149) 
(6.150) 
(6.151) 

(6.152) 

(6.153) 
(6.154) 
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one can readily show that the states yj m (£, ||r) are also eigenstates of J ■ S. Employing (6.149, 
6.151, 6.152) one derives 



2J.Sy jm {j-\,\\r) 

= h 2 [jU + 1) - (i - §)(J + §)-!] y jm u -hW) 
= n 2 (j - \)y jr n{j-hW) 



and 



(6.155) 



2J-Sy 3m (j + hl\r) 

= h 2 [j(j + i) - 0" + i)0' + §) - 

= ^(-J-f)^ m (i + |,||r). 



}y jm (j + hh\r) 



(6.156) 



Orthonormality Properties 

The construction (6.141) in terms of Clebsch-Gordon coefficients produces normalized states. Since 
eigenstates of hermitean operators, i.e., of {J + S) 2 , {J + S)z, J 2 with different eigenvalues are 
orthogonal, one can conclude the orthonormality property 

/+7T /•27T 
smew / # [y*, m \t', \\e, <p) } T y* m (£, \\e, <t>) = s jjf s mml s u , (6.157) 
-7T JO 



where we have introduced the angular variables 9,<p to represent f and used the notation [■ • -] T 

Vj/ m i (£ , \ I 



to denote the transpose of the two-dimensional vectors y*/ m /(£', \\0, <f>) which defines the scalar 
product 



(a* b')( C )=a*c+ b*d. 



(6.158) 



The Operator a ■ f 

Another important property of the spin-orbital angular momentum states (6.147, 6.148) concerns 
the effect of the operator a ■ r on these states. In a representation defined by the states (6.139), 
this operator can be represented by a 2 x 2 matrix. 
We want to show that the operator a ■ r in the basis 



{(y jm u-hh\r),y jm u + ih\ 



assumes the block-diagonal form 



. ; m 



( -1 
-1 



-j, -i + 1, ...+j} 



(6.159) 



a ■ r = 



-1 
-1 



(6.160) 



V 



/ 
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where the blocks operate on two-dimensional subspaces spanned by {3^m(i — |,||^), 3^"m(j + 
§, §|r)}. We first demonstrate that a ■ r is block-diagonal. This property follows from the commu- 
tation relationships 

[J^ tot \a-r] = 0, k = 1, 2, 3 (6.161) 

where <7 fe *°* is defined in (6.137) To prove this we consider the case k = 1. For the l.h.s. of (6.161) 
holds, using (6.137), 

[Ji + Si, flXl + 02^2 + CJ3X3] 

= V2[Jl,X 2 ] + [Si, C7 2 ]x 2 + o- 3 [Ji,x 3 ] + [Si, C7 3 ]X 3 . (6.162) 

The commutation properties [cf. (5.53) for J\ and (5.228), (6.138) for a and Si] 

\J\,x 2 \ = -ih[x 2 d 3 - x 3 d 2 , x 2 ] = ihx 3 (6.163) 
[Ji,x 3 ] = -ih[x 2 d 3 - x 3 d 2 , x 3 ] = -ihx 2 (6.164) 

[Si, a 2 ] = ^[oi, <7 2 ] = ^<r 3 (6.165) 

[Si, 0-3] = ^[o"i, 0-3] = -^<t 2 (6.166) 
allow one then to evaluate the commutator (6.161) for k = 1 

[ l 7 1 >*' ) , _ ifr( a2X3 + _ _ - 2X3 ) = . (6.167) 

One can carry out this algebra in a similar way for the k = 2, 3 and, hence, prove (6.161). 
Since the differential operators in J^°^ do not contain derivatives with respect to r, the property 
(6.161) applies also to a ■ f, i.e., it holds 

[Ji tat \ d-r] = , k = 1, 23. (6.168) 



From this follows 



= h 2 j(j + l)a-fy jm (j±i,i\f), (6.169) 

i.e., a ■ ryj m (j ± §, ||f) is an eigenstate of (J'C *)) 2 with eigenvalue h 2 j(j + 1). One can prove 
similarly that this state is also an eigenstate of ^3 with eigenvalue hm. Since in the space 
spanned by the basis (6.159) only two states exist with such eigenvalues, namely, 3^m(j ± §, ||r), 
one can conclude 

o -ry jm {i + \,\\r) = a ++ (jm)y jm (j + §,±|r) + a + -(jm) y jm (j - §, \\r) (6.170) 
and, similarly, 

5 -ry jm {j - \,\\r) = a- + {jm)y jm {j+\,\\r) + a—(jm) y jm (j - ±, \\r) . (6.171) 
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We have denoted here that the expansion coefficients ct±±, in principle, depend on j and m. 

We want to demonstrate now that the coefficients a±±, actually, do not depend on m. This property 

follows from 

[jl tot \a-f] = (6.172) 

which is a consequence of (6.168) and the definition of J^ ot ^ [ c .f. (6.35)]. We will, hence, use the 
notation a±±(j) 



Exercise 6.5.1: Show that (6.172) implies that the coefficients a±± in (6.170, 6.171) are indepen- 
dent of m. 



We want to show now that the coeffients a ++ (j) and a (j) in (6.170, 6.171) vanish. For this 

purpose we consider the parity of the operator a ■ f and the parity of the states 3^' m (j ± §, §|r), 
i.e., their property to change only by a factor ±1 under spatial inversion. For a • r holds 

u • v — > a ■ (— r) = — a • f , (6.173) 

i.e., a ■ f has odd parity. Replacing the f-dependence by the corresponding (9, (^-dependence and 
noting the inversion symmetry of spherical harmonics [c.f. (5.166)] 

Yj+^iv-O'V + ti = i- l ) j+hY j+ \m±^A) (6-174) 
one can conclude for 3^ m (j + §, \ \f) as given by (6.142) 

yj m (j + h\\o,<j>) - y jm {j + h\\K-6,<K + <i>) = {-iy + h ]m {j + bk\0A). (6.175) 

Similarly follows for 3^ m (j — §, \ \?) 

y jm {j-h\\0,<j>) - (-ly-hjmU + hWA) ■ (6.176) 

We note that 3^m(i + §, \ and 3^m(j — |, \ \r) have opposite parity. Since a • f has odd parity, 
i.e., when applied to the states 3^ m (j ± |, ||f) changes their parity, we can conclude a++(j) = 

a (j) = 0. The operator a-f in the two-dimensional subspace spanned by yj m (j± ±, ||f) assumes 

then the form 

'■'-Uc,-, V)- 

Since <r - f must be a hermitean operator it must hold a = a* (j). 

According to (5.230) one obtains 

(<r-f) 2 = 11 . (6.178) 
This implies |a+_(j)| = 1 and, therefore, one can write 

B-r = ( J m e Q ) , (3(j) e R. (6.179) 



e 



One can demonstrate that a - f is, in fact, a real operator. For this purpose one considers the 
operation of a ■ r for the special case <p = 0. According to the expressions (6.147, 6.148) for 
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yjm(j ± \, ^\9,4>) and (5.174-5.177) one notes that for = the spin-angular momentum states 
are entirely real such that & ■ f must be real as well. One can conclude then 

*-r = ±( J j) (6.180) 

where the sign could depend on j. 

We want to demonstrate finally that the "—"-sign holds in (6.180). For this purpose we consider 
the application of u ■ f in the case of 9 = 0. According to (5.180) and (6.147, 6.148) the particular 
states 3^-i (j — §, \\r) and 3^i(j + |, at = are 



|,||^ = O,0) = VTF (6.181) 



Since c? • f , given by 



^•|(i + hk\e = o,<p) = |-V Q T#j. (6.182) 



a ■ r = (Jisin0cos0 + 02 sin sin + 0-3 cos 9, (6.183) 



in case 9 = becomes in the standard representation with respect to the spin-| states xi±l 
[c.f. (5.224)] 

^■r = ( J J ) , for 9 = (6.184) 

V / space xi,i 

2*2 

one can conclude from (6.181, 6.182) 

<?-^i(j-Ul# = O,0) = 1,11^ = 0,0) , for# = 0. (6.185) 

We have, hence, identified the sign of (6.180) and, therefore, have proven (6.160). The result can 
also be stated in the compact form 



a -ry jm (j ± 1, i|f) = -y jm (j T b \\r) (6.186) 



The Operator a ■ p 



The operator a-p plays an important role in relativistic quantum mechanics. We want to determine 
its action on the wave functions /(r)3^m(j ± ||^)- Noting that p = —ihV is a first order 
differential operator it holds 

<?-pf(r)y jm ti±hk\?) = {{a-~pf{r)))y 3m {j±b\\r) + f(r) a ■ i>y jm (j ± 1, i|f) (6.187) 

Here (( • • • )) denotes again confinement of the diffusion operator d r to within the double bracket. 
Since f(r) is independent of 9 and (j) follows 

((a-£/(r))) = -ih((d r f(r)))#-r. (6.188) 
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Using (6.186) for both terms in (6.187) one obtains 

a ■ pf(r)y jm (j ± ±,§|f) = ihd r f(r) - f(r)a-pa-r y jm (j T \, W) ■ 

The celebrated property of the Pauli matrices (5.230) allows one to express 

a ■ pa ■ r = p ■ r + ia • (px r) . 

For the test function h(r) holds 

' r 



(6.189) 
(6.190) 



p-rh 



-ihV ■ ( - h 
r 



-in— V • r — inr ■ V — 
r r 



3 1 

= —ih-h — ihhr-V- — ihr-Vh. 



Using V(l/r) = —f/r s and f • V/i = d r h one can conclude 



p-rh 



-ih 



+ d r ) h 



(6.191) 



(6.192) 



The operator px f in (6.190) can be related to the angular momentum operator. To demonstrate 
this we consider one of its cartision components, e.g., 



(p xf) 1 h 



-ih(d 2 — - d 3 — ) h 
r r 

-ih-(d2X 3 - <9 3 x 2 ) h - ihh(x 3 d2 X2d 3 -) . 

r r r 



(6.193) 



Using 82(1 /r) = —X2/r 3 , d 3 (l/r) = —x 3 /r^ and ^2X3 = 23(92, O3X2 = X2<93 we obtain 

(pxf)ih = -ih - (x 3 <9 2 - x 2 d 3 ) h = — - Ji h 
r r 



(6.194) 



where J\ is defined in (5.53). Corresponding results are obtained for the other components of p x f 
and, hence, we conclude the intuitively expected identity 



px f = - -J . 

r 

Altogether we obtain, using d r yj m (j ± i, ||f) = and a = 2S/H, 

2h 17-9 

d-pf{r)y jm {j±^\\r) = i[hd r + - + -^-]J(r)y jm {jThW) 
Using (6.155, 6.156) this yields finally 



(6.195) 



(6.196) 



d r + 



j + i 



f(r)y jm (j-hh\r) 



&-pg{r)yj m {j-\,\\r) = ih 



d r + 



9{r)yjm{j+hl\r) 



(6.197) 



(6.198) 
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To demonstrate the validity of this key result we note that according to (5.230, 5.99) holds 

(v.fr = _,2 V 2 = (6.199) 
We want to show that eqs. (6.197, 6.198), in fact, are consistent with this identity. We note 

^■f) 2 f(r)y jm (j + hh\r) 
= H3-p[idr + l -(j + l)]f(r)y jm (j-hh\r) 

= [Mr + ~ r {\ ~ J) }[iO r + + \) ] f{r) y jm (j + hh\r) 
= ^l-d 2 r -\d r + j Ai- il ~ J li J + l) ] f(r)y jm (j + hh\r) 
= h 2 [-d 2 r - 2 -d r + j2 + 2 J + h f(r)y jm (j + hh\r) (6-200) 
and, using (5.101), f + 2 j + f = (j + + §), as well as (6.151), i.e., 

n 2 {3 + \){3 + l)y jm {j + h\\r) = J 2 y jm (3 + h\\r) (6-201) 

yields 

(3-p) 2 f(r)y jm U + hh\r) = ("y^ + ^) f(r)y jm (j+hh\r) 
which agrees with (6.199). 

Evaluation of Relevant Clebsch-Gordan Coefficients 

We want to determine now the Clebsch-Gordan coefficients (6.143-6.146). For this purpose we use 
the construction method introduced in Sec. 6.2. We begin with the coefficients (6.143, 6.144) and, 
adopting the method in Sec. 6.2, consider first the case of the largest m-value m = j. In this case 
holds, according to (6.43), 

yjjti-hm = Y 3 _ h _ k {r) Xv _ . (6.202) 
The Clebsch-Gordan coefficients are then 

(j-h3-bbh\jJ) = 1 (6-203) 
(j-hJ + hb-h\jJ) = (6.204) 

(6.205) 

which agrees with the expressions (6.143, 6.144) for m = j. 
For m = j — 1 one can state, according to (6.45), 
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The corresponding Clebsch-Gordan coefficients are then 



ti-hJ-hhlljJ-l) = \ \A (6-207) 



2j 

= yf^ (6-208) 

(6.209) 



which again agrees with the expressions (6.143, 6.144) for m = j — 1. 

Expression (6.206), as described in Sec. 6.2, is obtained by applying the operator [c.f. (6.137)] 

j{tot) = j_ + s _ (6.210) 

to (6.202). The further Clebsch-Gordan coefficients (• • • \jj — 2), (• • • \jj — 3), etc., are obtained 
by iterating the application of (6.210). Let us verify then the expression (6.143, 6.144) for the 
Clebsch-Gordan coefficients by induction. (6.143, 6.144) implies for j = m 



VmU - 5. = Y^m-l X U + . (6.211) 

Applying J^ ot) to the l.h.s. and J_ + S_ to the r.h.s. [c.f. (6.210)] yields 
V U + m)(j-m+ 1) y jm -i(j - |,i|f) = 

/ j f f~¥~li 



j+m 



I J 2 y 

• \ •., vX/ ' " ,){J '" ]Y . - , 



.J m ._i Xl-i 



or 



^m-l(j - §,f|r) = 



j + m— 1 ^ r 

2j J 2 m 2 22 



0' - m )t L-/ ; — , in + W o,/, — , IN I ^-im-iXi-i 



2j(j-m + l) V2j(j-m+l) J ~2 m " 



2 2 2 



lj + m-1 j-m + 1 



2j ^2™- 2 2 2 W 2j 



2 2 2 2 



This implies 
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/ q — fn -4- 1 

(j-l,m-i,l,-i\j,m-l) = J J - (6.213) 

which is in agreement with (6.143, 6.144) for j = m — 1. We have, hence, proven (6.143, 6.144) 
by induction. 

The Clebsch-Gordan coefficients (6.146) can be obtained from (6.143) by applying the symmetry 
relationships (6.116, 6.120). The latter relationships applied together read 

{e 1 ,m 1 ,e 2 ,m 2 \e 3 ,m 3 ) = (-i)to+*s-ti+to+m2 x ( 6 .214) 

|^4jr[ ^,m 3 ,£ 2 , -m 2 \£i,m 1 ) 
For 



(j,m,i,i|i + i,m+i) = J 3 + .™ + 1 , (6.215) 



which follows from (6.143), the relationship (6.214) yields 



(j.m.i^\j + i.n, + ^ r- J^±l(j + i 2 . m + i 2 .i 2 . ..(;.„,) (0.210) 



and, using (6.215), one obtains 



(j + h m +hh ,- h \j,m) = J j + ™ + 1 . (6.217) 



Similarly, one can obtain expression (6.145) from (6.144). 



6.6 The 3j— Coefficients 

The Clebsch-Gordan coefficients describe the quantum mechanical equivalent of the addition of two 
classical angular momentum vectors and J7^g to obtain the total angular momentum vector 

JcUss = J§L + ^c\L- In this context jJj£L and J^ ss play the same role, leading quantum 
mechanically to a symmetry of the Clebsch-Gordan coefficients ( J M\£\m\l 2 m 2 ) with respect to 
exchange of l\m\ and £ 2 m 2 . However, a higher degree of symmetry is obtained if one rather 
considers classically to obtain a vector J^bbs^ with tlie P ro P ert y ^JllL + ^laL + ^ss°^ = °- 
Obviously, all three vectors J^ s , J^L and J^f' P la y equivalent roles. 

The coefficients which are the quantum mechanical equivalent to J7^ s + <7^bs + ^lass^ = ^ 
are the 3j-coefficients introduced by Wigner. They are related in a simple manner to the Clebsch- 
Gordan coefficients 

( 3 m\ m 2 m 3 ) = (-l) jW2+m3 (2 j3 + l)^(j 3 - m,\h m u j 2 m 2 ) (6.218) 



where we have replaced the quantum numbers J, M, £i, mi, £ 2 , m 2 by the set ji,mi,j 2 ,m 2 ,js,m3 
to reflect in the notation the symmetry of these quantities. 
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We first like to point out that conditions (6.21, 6.34) imply 



Jl 32 33 

mi f"2 f"3 

jl 32 33 

mi m 2 m 3 



if not mi + m 2 + 7713 = 



= if not \ji - j 2 \ < 33 < Jl + 32 



(6.219) 



(6.220) 



The latter condition \ ji — j 2 \ < j'3 < ji + j 2 , the so-called triangle condition, states that ji,j 2 ,j3 
form the sides of a triangle and the condition is symmetric in the three quantum numbers. 
According to the definition of the 3 j-coefficients one would expect symmetry properties with respect 
to exchange of ji , mi , j 2 , m 2 and 33 , 7713 and with respect to sign reversals of all three values 
777.1,777.2,7773, i.e. with respect to alltogether 12 symmetry operations. These symmetries follow the 
equations 



3i 32 33 

777i "T-2 7"3 

= (—iyi+h+33 



32 33 31 

7772 7773 ?77l 



( — iyi+32+33 



32 31 33 

7772 ?77l 7773 



31 

-mi 



32 

-m 2 



33 

-777 3 



(6.221) 



where the the results of a cyclic, anti-cyclic exchange and of a sign reversal are stated. In this way 
the values of 12 3j-coefficients are related. 

However, there exist even further symmetry properties, discovered by Regge, for which no known 
classical analogue exists. To represent the full symmetry one expresses the 3j-coefficients through 
a 3 x 3-matrix, the Regge-symbol, as follows 



3i 32 33 

nil "72 "1-3 



~jl + 32 + 33 jl - 32 + 33 jl + 32 - J3 
jl - mi 22 - m 2 j 3 - 7773 

jl + 7771 j 2 + 777 2 33 + m 3 



(6.222) 



The Regge symbol vanishes, except when all elements are non-negative integers and each row and 
column has the same integer value S = ji + j 2 + j'3. The Regge symbol also vanishes in case that 
two rows or columns are identical and S is an odd integer. The Regge symbol reflects a remarkable 
degree of symmetry of the related 3j-coefficients: One can exchange rows, one can exchange columns 
and one can reflect at the diagonal (transposition) . In case of a non-cyclic exchange of rows and 
columns the 3j-coefficent assumes a prefactor (— l) s . These symmetry operations relate altogether 
72 3j-coefficients. 

The reader may note that the entries of the Regge-symbol, e.g., —j\ + j 2 +33, are identical to the 
integer arguments which enter the analytical expression (6.102) of the Clebsch-Gordan coefficients, 
safe for the prefactor ^2^3 + 1 which is cancelled according to the definition (6.218) relating 3j- 
coefficients and Clebsch-Gordan coefficients. The two integer entries J — £1 — m 2 and J — £ 2 + 777 1 
in (6.102) are obtained each through the difference of two entries of the Regge-symbol. 
Because of its high degree of symmetry the Regge symbol is very suited for numerical evaluations 
of the 3j'-coefficents. For this purpose one can use the symmetry transformations to place the 
smallest element into the upper left corner of the Regge symbol. Assuming this placement the 
Regge symbol can be determined as follows (mi is the smallest element!) 



77n "12 "13 
"21 "22 "23 
"31 "32 "33 



("I) 



"12!"13!"2l!"3l! 



"11 



(S + l)!"ll!n22!"33!"23!"32D) n 



(6.223) 
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where 

S = nn + ni2 + ni 3 = ji + j 2 + j'3 (6.224) 
s = 7 " 23! m 7 " 32! m (6.225) 

u (n23 — nn)! (n32~n.11)! v ' 

S„ = - (nil+l-n)(n 22 +l-n)(n 3 3 + l-n) , g ^ 

" n(n23— "ii+n)(n32— nn+n) " 1 \ / 

We like to state finally a few explicit analytical expressions for Clebsch-Gordan coefficients which 
were actually obtained using (6.223-6.226) by means of a symbolic manipulation package (Mathe- 
matica) 

n .„ n . (-l) 1+m+mi <f(m, mi + m 2 ) ^(2 - mi) V(2 + m~i)\ 



10^(1 - m) V(l + "0 V( X - m 2 )Wi l + ^2)! 
1 > |m| A 1 > |m 2 | A 2 > |mi| (6.227) 



. f-l) m+mi (m + 2m 2 ) S(m,mi + m 2 )-i/(2 - mi)! (2 + mi)! 

(2m \lm\\m 2 ) = — . . . , 

^/6V ( 2 - m)! y/{2 + m) V(l - m 2 )! + m 2 )\ 

1 > |m 2 | A 2 > \m\ A 2 > |mi| (6.228) 



(-l) 2mi - 2m2 V7 <5(m, mi + m 2 ) ^(3 - m)\ ^(3 + m)! 

(3m2milm 2 J = — ; — . — . — . — . - 

VT05 ^(2 - mi)!V(2 + mi)! ^(1 - m 2 )! ^(1 + m 2 )! 

1 > |m 2 |) A 2 > |mi| A 3 > |m| 



(6.229) 



/ ,1 1 X 

(0m|-mi-m 2 ) 



2 ^ 



(6.230) 



„ .1 1 N (-l) 2mi_2mj V3 <5(m, mi + m 2 ) ^(1 - m) VU + m)! 

(lm|-mi-m 2 ) = / / / / 

2 2 v/67(| - mi)!^(i + mi )\yJ{\ - m 2 )!^(| + m 2 )! 

1 , 1 . , , 

2 > l m il A 2 - I™ 2 ' A 1 - l m ' (6.231) 

Here is an explicit value of a Clebsch-Gordan coefficient: 



(70± —15^|120 -1060| -h\) = 

4793185293503147294940209340 v / l27v / 142 
V35834261990081573635135027068718971996984731222241334046198355 

~ 0.10752786393409395427444450130056540562826159542886 (6.232) 
We also illustrate the numerical values of a sequence of 3j-coefficients in Figure 6.1. 
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3j to. 

10 1 20 
18 
16 
14 
12 
10 
8 
6 



-10 
-12 
-14 
-16 



Figure 6.1: Oscillatory behavior of 3j-coefficients. 



Irreducible Representation 



We had stated above that the spherical harmonics Yg m (£l)), eigenfunctions of the single particle 
angular momentum operators J 2 and J7" 3 , provide the irreducible representation for i.e. the 

rotations in single particle function space. Similarly, the 2-particle total angular momentum wave 
functions 2)jm(^i, ^l^ij ^2) provide the irreducible representation for the rotation 9t(i?) defined in 
(6.5), i.e. rotations in 2-particle function space. If we define the matrix representation of 9t(i9) by 
23(i?), then for a basis {Yi irni (Ui)Yi 2m2 (^l 2 ) , h, h = 1,2, . . . , mi = -£i, . . .+£1, m 2 = -£2, ■ ■ -+£2} 
the matrix has the blockdiagonal form 




60 -40 -20 20 40 60 



L-lxl-l 



1-3x1-3 



2>(<?) = 



{2h + 1) (2£i + 1) 
(2£ 2 + 1) x {2£ 2 + 1) 



(6.233) 



••• / 
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For the basis {2),/m(4,4|^i, Sh), 4,4 = 1,2, ... , J = |4 - 4|, 
each of the blocks in (6.233) is further block-diagonalized as follows 



,4 + 4, m 



-j,... j} 



(24 + 1) (24 + 1) 
(24 + 1) x {2£ 2 + 1) 



(2|4 - 


-£ 2 \ + l) 


x(2\£i 


-4| + 1) 



(2|4 - 


4|+3) 


x(2\£i- 


41 + 3) 



(6.234) 



(2(4 + £ 2 ) + 1) 
x(2(4+4) + l) 



Partitioning in smaller blocks is not possible. 



Exercise 6.6.1: Prove Eqs. (6.233,6.234) 

Exercise 6.6.2: How many overall singlet states can be constructed from four spin-^ states 
1 | § 777 '2)(2) 1 3 7Tl 3)(3) 1 5 rri 4)(4)^ > Construct these singlet states in terms of the product wave 
functions above. 

Exercise 6.6.3: Two triplet states |lmi)ndlm2)(2) are coupled to an overall singlet state 2)oo(l, !)• 
Show that the probability of detecting a triplet substate |lmi)(i) f° r arbitrary polarization (re- 
value) of the other triplet is | . 
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6.7 Tensor Operators and Wigner-Eckart Theorem 

In this Section we want to discuss operators which have the property that they impart angular 
momentum and spin properties onto a quantum state. Such operators T can be characterized 
through their behaviour under rotational transformations. 

Let T\ip) denote the state obtained after the operator T has been applied and let TZ(-d) denote 
a rotation in the representation of SO(3) or SU(2) which describes rotational transformations 
of the quantum states under consideration, e.g. the operator (5.42) in case (i) of the position 
representation of single particle wave functions or the operator (5.222) in case (ii) of single particle 
spin operators. Note that in the examples mentioned the operator T would be defined within the 
same representation as 7Z($). This implies, for example, that in 

case (i) T is an operator Coo(F) 
tiplicative operator T ip(f) 

case (ii) the operator T could be a spin operator Sk defined in (5.223), e.g. S 2 = Sf + S 2 + S 2 
or any other polynomial of S^. 

The operator T may also act on multi-particle states like Yg lJni (ri)Ye 2Tn2 ($2 )• In fact, some of the 
examples considered below involve tensor operators T of this type. 

Rotations transform \tp) as \tp') = 7£($)|V>) and T\ip) as 7£($)T|^j). The latter can be written 
T'\tp') where T' denotes T in the rotated frame given by 

T' = TZ0) T1Z~ 1 {$) . (6.235) 

The property that T imparts onto states \ip) angular momentum or spin corresponds to T behaving 
as an angular momentum or spin state multiplying \if)). The latter implies that T transforms like 
an angular momentum or spin state \£m), i.e. that T belongs to a family of operators {X^, q = 
—k, — k + 1, ... k} such that 

k 

T k = E V ?M T W- (6-236) 

q'=-k 

In this equation V^(3) denotes the rotation matrix 

Vf q 0) = {kq'\n0)\kq) . (6.237) 

The operators T G {T^q, q = —k, —k + l,...k} with the transformation property (6.236, 6.237) 
are called tensor operators of rank k. 

Examples of Tensor Operators 

The multiplicative operators Coo(^) — > Coo(F) 

Y lu (<) d ^ ^Y^f (6.238) 



— > Coo(i^) acting on single particle wave functions, e.g. a mul- 
= f(r)if)(r) or a differential operator Ttp(r) = + + 
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are tensor operators of rank k. Examples are 

1 



Y 00 
Yi±i 
Yio 

Y 2±2 

Y 2 ±i 
Y 20 



4"7T 





(xi ± ix 2 ) 




4ir 
1 flE 
4 V 2^ 
15 



rcosO 



X3 




r 2 sm 2 6 e ±2i * 



r 2 sin^ e ±i4 




1 

4 V 2vr 
15 

8tt 



(xi ± ix 2 ) 2 

(Xl ± iX 2 ) X3 




-cos 2 e 




r 2 
> x 3 



r 
2~ 



(6.239) 



These operators can be expressed in terms of the coordinates x±,X2,x 3 . The fact that these opera- 
tors form tensor operators of rank 1,2,3 follows from the transformation properties of the spherical 
harmonics derived in Section 1.3. 



Exercise 6.7.1: Show that the following set of spin operators 



Too 


= 1 


Ti±i 




T10 


= s 3 


T2±2 


= (5 ± ) 2 


?2±1 


= T(S 3 S ± + S ± S 3 ) 


T20 


= ^(3^1 - S 2 ) 



are tensor operators of rank 0, 1, 2. 

Exercise 6.7.2: Express the transformed versions of the following operators (a) T 
(b) S2S3 i n terms of Wigner rotation matrices and untransformed operators. 



x\ — x\ and 



For the following it is important to note that the rotation matrix elements (6.237) do not require 
that the rotation IZ^ 1 ^) is expressed in terms of Euler angles according to (5.203), but rather 
any rotation and any parametrization can be assumed. In fact, we will assume presently that the 
rotation is chosen as follows 



n{$) = exp(tf+L+ + + tf 3 L 3 



(6.240) 



where we have defined $± = \{$\ T ^2) and L± = L\ ± iL 2 . This choice of parametrization 
allows us to derive conditions which are equivalent to the property (6.235 - 6.237), but are far easier 
to ascertain for any particular operator. 



6.7: Tensor Operators 



179 



The conditions can be derived if we consider the property (6.235 - 6.237) for transformations 
characterized by infinitesimal values of i?_ , #3. We first consider a rotation with d = (1?+, 0, 0) T 
in case of small The property (6.235 - 6.237) yields first 

k 

Tt0) T kq TIT 1 0) = {kc(\n0)\kq)T kq , . (6.241) 

q'=~k 

Using 1Z0) = 1 + $+L+ + 0(i?+) this equation can be rewritten neglecting terms of order 0($^_) 

k 

(1 + # + L + )T kq (l - $ + L + ) = + $+L + \kq)T kq , . (6.242) 

q' = -k 

from which follows by means of (kq'\H\kq) = 5 qq i and by subtracting T kq on both sides of the 
equation 

k 

# + [L + ,T kq ] = #+(kq'\L+\kq)T kq , . (6.243) 

q' = -k 

From (5.80) follows (kq'\L + \kq) = —i^J{k + q + l)(/c — g)<5 9 ' q+ \ and, hence, 

[L+,T fc ,] = -iT kq+1 y/{k + q+l){k-q) . (6.244) 

Similar equations can be derived for infinitesimal rotations of the form $ = (0, 0) T , (0, 0, ??3) T . 
Expressing the results in terms of the angular momentum operators J+ , J_ , J3 yields 

[J+,T kq ] = hT kq+iy /(k + q + l)(k-q) (6.245) 
[J_,T fe(? ] = HT kq -i ^/ (k + q)(k — q + 1) (6.246) 
[J3,r fc ,] = hqT kq . (6.247) 

These properties often can be readily demonstrated for operators and the transformation properties 
(6.235 - 6.237) be assumed then. 



Exercise 6.7.3: Derive Eqs. (6.246, 6.247). 
Exercise 6.7.4: Is the 1-particle Hamiltonian 



H = -^V 2 + V(\r\) (6.248) 

_■ fib 

a tensor operator? 

Exercise 6.7.5: Consider a system of two spin-^ particles for which the first spin is described by 
the operator and the second spin by the operator S^ 2 \ Show that ■ is a tensor operator 
of rank in the space of the products of the corresponding spin states |^mi)(i)|^m,2)(2). For this 
purpose state first the proper rotation operator 1Z{jD) and note then that ■ commutes with 
the generators of the rotation of l^mi)^)!^?^)^)- 

Exercise 6.7.6: Form tensor operators of rank 1 in terms of the three components of V acting on 
the space of 1-particle wave functions. 
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6.8 Wigner-Eckart Theorem 

A second important property of tensor operators T kq beside (6.235 - 6.237) is that their matrix ele- 
ments {£\m\^\\T kq \£ 2 m 2 ^( 2 ) obey simple relationships expressed in terms of Clebsch-Gordan coeffi- 
cients. |^imi7i) denotes an angular momentum (spin) state, possibly the total angular momentum- 
spin state of a compositie system, which is characterized also by a set of other quantum numbers 71 
which are not affected by the rotational transformation lZ(-d). The relationships among the matrix 
elements {£imi~/i\T kq \£ 2 m 2 j 2 ) are stated by the Wigner-Eckart theorem which we will derive now. 
Starting point of the derivation is the fact that the states T kq \£ 2 m 2 -y 2 ) behave like angular momen- 
tum states of a composite system of two particles each carrying angular momentum or spin, i.e. 
behave like \kq)\£\mi) . To prove this we consider the transformation of T kq \ £ 2 m 2 ^ 2 ) 



110) T kq \£ 2 m 2l2 ) = 110) T kq ll' l 0) 110) \£ 2 m 2l2 ) 

>(<a) 

m' 2 m2 



£ IW2> (6-249) 



q' m 2 

which demonstrates, in fact, the stated property. One can, hence, construct states ^e imi (k,£2\j2) 
which correspond to total angular momentum states. These states according to (6.18) are defined 
through 

QtmikM-n) = ^2(hm 1 \kq£ 2 m 2 )T kq \e 2 m 2l2 ) . (6.250) 
q,mi 

We want to show now that these states are eigenstates of J 2 = J 2 + Jf + Jf and of J3 where 
Ji, J2, J3 are the generators of the rotation lZ(-d). Before we proceed we like to point out that the 
states |^2"i272) are also eigenstates of J 2 , J3, i.e. 

J 2 I £ 2 m 2 ~f 2 ) = h 2 £ 2 (£ 2 + l)\£ 2 m 2 j 2 ) ; J 3 | £ 2 m 2 j 2 ) = %m 2 \£ 2 m 2 ^ 2 ) . (6.251) 

The corresponding property for &e imi (k,£ 2 \"/ 2 ) can be shown readily as follows using (6.247), 
J3\£2m 2 "f 2 ) = hm 2 'j 2 \£ 2 m 2 ) and the property (6.21) of Clebsch-Gordan coefficients 

Jz^£ im A k ^2\l2) = ^2{£imi\kq£ 2 m 2 ) (J 3 T kq - T kq J 3 +T kq J 3 )\£ 2 m 2 -f 2 ) 

q ' m2 ^ 
= h (£imi\kq£ 2 m 2 ) (q + m 2 ) T kq \ £ 2 m 2 ~/ 2 } 

= hmi '^2(£imi\kq£ 2 m 2 )T kq \£ 2 m 2 j 2 ) (6.252) 

q,m 2 

Similarly, one can show that $£ im i(fc>^2|72) is an eigenstate of J 2 with eigenvalue f?£\(£\ + 1). The 
hermitian property of J3 and J 2 implies that states <&£ imi {k,£ 2 \^ 2 ) are orthogonal to l^m^) in case 
of different quantum numbers £i,mi, i.e. 

(£' 1 m' 1 \<f> eimi (k,£ 2 \72) = 06^5^^ (6.253) 



Exercise 6.8.1: Show that &£ imi (k,£ 2 \~f 2 ) is an eigenstate of J 2 with eigenvalue h 2 £i(£i + 1). 



6.8: Wigner-Eckart Theorem 
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In order to evaluate the matrix elements (£imiji\Tk q \£2'm2^2) we express using the equivalent of 
(6.32) 

T kq \£ 2 m 2 j2) = ^2(hm 1 \kq£ 2 m2)^i imi (khh2) (6.254) 

<imi 

and orthogonality property (6.253) 

(hmi-fi\T kq \£ 2 m2j2) = (hmiji\®e imi (k£2\j2)) (£imi\kq£2m 2 ) . (6.255) 

At this point the important property can be proven that (^lm-iTil^m^fc^^)) is independent 
of mi, i.e. the matrix elements (^imi7i \Tk q \£2m2l 2 ) can be reduced to an mi-independent factor, 
its mi-dependence being expressed solely through a Clebsch-Gordan coefficient. To prove this 
property we consider (-^imi7i|<I>£ imi (fc£ 2 |7 2 )) for a different mi value, say mi + 1. Using 

|*imi + l 7 i> = m ■ 1 == . J+|£im l7l ) (6.256) 

y/{£i + mi + l)(£i - mi) 

and noting that the operator adjoint to J + is J_, one obtains 

(4mi + l7i|^ imi+ i(H 2 |7 2 ) = 

(4mi7i|J_^ imi+ i(A;^2|72)) = 



—mi) 

(^imi7i|^ imi (H 2 |72)) (6.257) 

which establishes the mi-independence of (^imi7i|<£v imi (fc£ 2 |72))- I n order to express the mi- 
independence explicitly we adopt the following notation 

^im l7 i|^ imi (H 2 |72)) = {-l) k - M ^=^={£iM\Tk\\£2,li) ■ (6.258) 

We can then finally express the matrix elements of the tensor operators T kq as follows 

(^imi,7i|T fc? |£ 2 m 2 ,7 2 ) = 
(£imi\kq£2m 2 ) (-1)*-*>+<i_^= (4, 7i||7a# 2 , 7 2> (6.259) 

The socalled reduced matrix element 7 i| |T^| \£ 2 , 72) is determined by applying (6.259) to a 
combination of magnetic quantum numbers m^q' ,m' 2 , e.g. m[ = q' = m' 2 = 0, for which 
the l.h.s. can be evaluated as easily as possible. One can then evaluate also the corresponding 
Clebsch-Gordan coefficient ^im'^kq' £ 2 m' 2 ) and determine 



(£i,7i\\n\\£2,72) 



^ 1 + 1 (-l)^^(^m;|^ 2 m 2 ) (6 - 260) 



Exercise 6.8.2: Determine the matrix elements of the gradient operator V of the type 

J d 3 rF(r)VG(r) (6.261) 
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when the functions F(r) and G(r) are of the type f(r)Yg m (r). For this purpose relate V to a tensor 
operator Ti q , evaluate the matrix for m± = q = 771,2 = using 

cos6Y em (6,(t)) = 

/ (l+l-m)(l+l+m) y- (ft di\ A- I ^~ m W+ m ) Y , , ffl ^ 
V (2l+l)(27+3) 9) + y (2£-l)(2<?+l) ^-lm^ , <PJ 

sin0 r €m (0,0) = 

v /(2£+l)(2£+3) l+imy ,YJ v /2«-l)(2^+l) * imA 

and express the remaining matrix elements using the Wigner-Eckart theorem. (The necessary 
evaluations are cumbersome, but a very useful exercise!) 



Chapter 7 



Motion in Spherically Symmetric 
Potentials 



We describe in this section the stationary bound states of quantum mechanical particles in spher- 
ically symmetric potentials V(r), i.e., in potentials which are solely a function of r and are inde- 
pendent of the angles 9, <p. Four examples will be studied. The first potential 

_ f 0<r<R 
w [ oo r > R v ' 

confines a freely moving particle to a spherical box of radius R. The second potential is of the 
square well type 



The third potential 



V(r) = X -mu?r 2 (7.3) 



2 

describes an isotropic harmonic oscillator . The fourth potential 

Ze 2 

V(r) = --y- (7.4) 
governs the motion of electrons in hydrogen- type atoms. 

Potential (7.4) is by far the most relevant of the four choices. It leads to the stationary electronic 
states of the hydrogen atom [Z = 1). The corresponding wave functions serve basis functions for 
multi-electron systems in atoms, molecules, and crystals. The potential (7.3) describes the motion 
of a charge in a uniformely charged sphere and can be employed to describe the motion of protons 
and neutrons in atomic nuclei 1 . The potentials (7.1, 7.2) serve as schematic descriptions of quantum 
particles, for example, in case of the so-called bag model of hadronic matter. 



1 See, for example, Simple Models of Complex Nuclei / The Shell Model and the Interacting Boson Model by I. 
Talmi (Harwood Academic Publishers, Poststrasse 22, 7000 Chur, Switzerland, 1993) 
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7.1 Radial Schrodinger Equation 

A classical particle moving in a potential V(r) is governed by the Newtonian equation of motion 

mv = — e r d r V{r) . (7.5) 

In the case of an angular independent potential angular momentum J = mfxvisa constant of 
motion. In fact, the time variation of J can be written, using (7.5) and f = v, 

^jj-J = v x mv + f x mv = I fxe r (—d r V(r)) = . (7.6) 

Since J k is also equal to the poisson bracket {H, J k } where H is the Hamiltonian, one can conclude 

j k = {H,J k } = 0, k = 1,2,3. (7.7) 

The correspondence principle dictates then for the quantum mechanical Hamiltonian operator H 
and angular momentum operators J k 

[H,J k ] =0, k = 1,2,3. (7.8) 

This property can also be proven readily employing the expression of the kinetic energy operator 
[c.f. (5.99)] 

h 2 h 2 1 7 2 

__^V 2 = ~-d 2 T r + (7.9) 
2m 2m r 2mr z 

the expressions (5.85- 5.87) for J k as well as the commutation property (5.61) of J 2 and J k . Ac- 
cordingly, stationary states ^E,l,mif) can be chosen as simultaneous eigenstates of the Hamiltonian 
operator H as well as of J 2 and J%, i.e., 

Hil> E jt, m {f) = E^ EAm (f) (7.10) 
J 2 4>E/A?) = & 2 ^+l)^,m(r) (7.11) 
J3^E,e,m(r) = hmip E ,e,m{r) . (7.12) 

In classical mechanics one can exploit the conservation of angular momentum to reduce the equation 
of motion to an equation governing solely the radial coordinate of the particle. For this purpose 
one concludes first that the conservation of angular momentum J implies a motion of the particle 
confined to a plane. Employing in this plane the coordinates r, 6 for the distance from the origin 
and for the angular position, one can state 

J = mr 2 9 . (7.13) 

The expression of the kinetic energy 

n 2 1 1 1 T 2 

= -mr + -mr = -mr + ^ (7.14 

2m 2 2 2 2mr 2 v ' 

and conservation of energy yield 

\mr 2 + + V(r) = E. (7.15) 
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This is a differential equation which governs solely the radial coordinate. It can be solved by 
integration of 



± 



2 

m 



E - V(r) 



J 2 
2mr 2 

Once, r(t) is determined the angular motion follows from (7.13), i.e., by integration of 

i J 



mr 2 (t) 



(7.16) 



(7.17) 



In analogy to the classical description one can derivem, in the present case, for the wave function 
of a quantum mechanical particle a differential equation which governs solely the r-dependence. 
Employing the kinetic energy operator in the form (7.9) one can write the stationary Schrodinger 
equation (7.10), using (7.11), 



h 2 l _ 2 J 2 

o r + 

2m r r 2mr 2 



+ V(r) - E t ,r. 



ipE,e,m(r) = . 



Adopting for ipE,e,m(^) the functional form 

ipE,e,m(r) = v E ,e, m {r)Y em (9,< 



(7.18) 



(7.19) 



where Yg m (9, (f)) are the angular momentum eigenstates defined in Section 5.4, equations (7.11, 7.12) 
are obeyed and one obtains for (7.10) 



+ 1) 



h 2 1 2 h 2 

o r + — 

2m r r 2mr 2 



+ V(r) - E, 



VE,e,m{r) 



. 



(7.20) 



Since this equation is independent of the quantum number m we drop the index m on the radial 
wave function VE t e : m(r) and E^ m . 

One can write (7.20) in the form of the one-dimensional Schrodinger equation 



2m 



d 2 r + V cS (r) - E 



<Pe{t) 







where 



V eS (r) 
<f>E(r) 



V(r) + 



h 2 t(i + 1) 



2mr 2 



r v E ,e,m(r) • 



(7.21) 



(7.22) 
(7.23) 



This demonstrates that the function rvE,e, m (r) describes the radial motion as a one-dimensional 
motion in the interval [0, oof governed by the effective potential (7.22) which is the original potential 
V(r) with an added rotational barrier potential ^mr^~- ^ n ^ s barrier, together with the original 
potential, can exclude particles from the space with small r values, but can also trap particles in 
the latter space giving rise to strong scattering resonances (see Section ??). 
Multiplying (7.20) by —2mr/h 2 yields the so-called radial Schrddinger equation 



9 2 r ~ - U(r) - nj 



rv K Ar) = . 



(7.24) 
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where we defined 

2 777 

U(r) = - W V(r) (7.25) 
4 = -^E e (7.26) 
In case E < 0, n assumes real values. We replaced in (7.20) the index E by the equivalent index 

K. 

Boundary Conditions 

In order to solve (7.20) one needs to specify proper boundary conditions 2 . For r — ► one may 
assume that the term 1(1 + l)/r 2 becomes larger than the potential U(r). In this case the solution 
is governed my 

d r ~ rv K 4r) = 0, r -+ (7.27) 

and, accordingly, assumes the general form 

rv K/ (r) ~ Ar i+1 + Br' 1 (7.28) 

or 

v K: e(r) ~ At 1 + Sr^ 1 (7.29) 

Only the first term is admissable. This follows for I > from consideration of the integral which 
measures the total particle density. The radial part of this integral is 

/•oo 

/ drr 2 vl £ (r) (7.30) 
J o 



and, hence, the term Br would contribute 



I£| 2 



f £ drr- 2e (7.31) 
JO 



which, for £ > is not integrable. For £ = the contribution of 2?r to the complete wave 

function is, using the expression (5.182) for Yqq, 

i^EAmir) ~ "iq • (7.32) 

The total kinetic energy resulting from this contribution is, according to a well-known result in 
Classical Electromagnetism 3 , 

^V 2 fe m (r1 ~ y/A^ B 5(x 1 )5(x 2 )5(x 3 ) . (7.33) 



2 A detailed discussion of the proper boundary conditions, in particular, at r = is found in the excellent 
monographs Quantum Mechanics I, II by A. Galindo and P. Pascual (Springer, Berlin, 1990) 

3 We refer here to the fact that the function <E>(r) = \jr is the solution of the Poisson equation V 2 = 
—4n5(x)8(y)S(z); see, for example, "Classical Electrodynamics, 2nd Ed." by J.D. Jackson (John Wiley, New York, 
1975). 
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Since there is no term in the stationary Schrodinger equation which could compensate this 5- 
function contribution we need to postulate that the second term in (7.29) is not permissible. One 
can conclude that the solution of the radial Schrodinger equation must obey 

rv K/ (r) -► for r -> . (7.34) 

The boundary conditions for r — ► oo are governed by two terms in the radial Schrodinger equation, 
namely, 

[d? — k|] rv K) i(r) = for r — ► oo . (7.35) 

We have assumed here linv—xx, V{r) = which is the convention for potentials. The solution of 
this equation is 

rv K/ {r) ~ Ae~ Kr + B e +Kr for r -> oo . (7.36) 

For bound states k is real and, hence, the second contribution is not permissible. We conclude, 
therefore, that the asymptotic boundary condition for the solution of the radial Schrodinger equa- 
tion (7.20) is 

rv Kj e(r) ~ e~ Kr for r -> oo . (7.37) 



Degeneracy of Energy Eigenvalues 

We have noted above that the differential operator appearing on the l.h.s. of the in radial 
Schrodinger equation (7.24) is independent of the angular momentum quantum number m. This 
implies that the energy eigenvalues associated with stationary bound states of radially symmetric 
potentials with identical £, but different m quantum number, assume the same values. This be- 
haviour is associated with the fact that any rotational transformation of a stationary state leaves 
the energy of a stationary state unaltered. This property holds since (7.8) implies 

[H, exp(-^-J)] = . (7.38) 

Applying the rotational transformation exp(— j=i? • J) to (7.10) yields then 

H exp(-^ • J) ^ EAm (r) = E exp(-^ • J) ^ EAm (r) , (7.39) 

i.e., any rotational transformation produces energetically degenerate stationary states. One might 
also apply the operators J± = J\ ± ifo to (7.10) and obtain for — £ < m < £ 

HJ±^EAm{^} = EJ±i; EAm (r), (7.40) 

which, together with the identities (5.172, 5.173), yields 

HrpEAm±i(i f ) = E^ EAm±1 (r) (7.41) 

where E is the same eigenvalue as in (7.40). One expect, therefore, that the stationary states for 
spherically symmetric potentials form groups of 2£ + 1 energetically degenerate states, so-called 
multiplets where £ = 0,1,2,.... Following a convention from atomic spectroscopy, one refers to 
the multiplets with £ = 0, 1,2,3 as the s, p, d, /-multipltes, respectively. 

In the remainder of this section we will solve the radial Schrodinger equation (7.20) for the potentials 
stated in (7.1-7.4). We seek to describe bound states for the particles, i.e., states with E < 0. 
States with E > 0, which play a key role in scattering processes, will be described in Section ??. 
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7.2 Free Particle Described in Spherical Coordinates 

We consider first the case of a particle moving in a force-free space described by the potential 

V(r) = . (7.42) 
The stationary Schrodinger equation for this potential reads 

h 2 



2m 



E 







(7.43) 



Stationary States Expressed in Cartesian Coordinates The general solution of (7.43), as 
expressed in (3.74-3.77), is 

i)(k\r) = Ne it? (7.44) 

where 

h 2 k 2 

E = OLl. > . (7.45) 
2m ~ y J 

The possible energies can assume continuous values. N in (7.44) is some suitably chosen normal- 
ization constant; the reader should be aware that (7.44) does not represent a localized particle and 
that the function is not square integrable. One chooses N such that the orthonormality property 

f d 3 r ^*(k'\r)^(k\r) = 5{k' - k) (7.46) 
holds. The proper normalization constant is N = (27r)~ 3 / 2 . 

In case of a force-free motion momentum is conserved. In fact, the Hamiltonian in the present case 

H = -^V 2 (7.47) 
2m 

commutes with the momentum operator p = (/i/i)V and, accordingly, the eigenfunctions of (7.45) 
can be chosen as simultaneous eigenfunction of the momentum operator. In fact, it holds 

pNj % * = HkNe**. (7.48) 
as one can derive using in (7.48) Cartesian coordinates, i.e., 



k ■ f = kixi + k 2 x 2 + k 3 x 3 . (7.49) 




Stationary States Expressed in Spherical Coordinates Rather than specifying energy 
through k = \k\ and the direction of the momentum through k = k/\k\ one can exploit the 
fact that the angular momentum operators J 2 and J$ given in (5.97) and in (5.92), respectively, 
commute with H Q as defined in (7.47). This latter property follows from (5.100) and (5.61). Ac- 
cordingly, one can choose stationary states of the free particle which are eigenfunctions of (7.47) as 
well as eigenfunctions of J 2 and J% described in Sect. 5.4. 



7.2: Free Particle 
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The corresponding stationary states, i.e., solutions of (7.43) are given by wave functions of the form 

4>(k,£,m\r) = v kie (r)Y em (e,4>) (7.50) 
where the radial wave functions obeys [c.f. (7.20)] 



h 2 1 o2 h 2 £(£ + 1) 

d 2 r r + , V ; - E e , 



Vk,e(r) = . 



2m r 7 2mr 2 

Using (7.45) and multiplying (7.20) by —2mr/h 2 yields the radial Schrodinger equation 



d 2 - 



£{£ + l) 



k 2 



rv k Ar) 



. 



(7.51) 



(7.52) 



We want to determine now the solutions Vk/(r) of this equation. 

We first notice that the solution of (7.52) is actually only a function of kr, i.e., one can write 
Vk,^) = jt(kr). In fact, one can readily show, introducing the new variable z = kr, that (7.52) 
is equivalent to 

1) 



dz 2 



1 



zje(z) 



. 



(7.53) 



According to the discussion in Sect. 7.1 the regular solution of this equation, at small r, behaves 
like 

j t (z) ~ z e for r -► . (7.54) 
There exists also a so-called irregular solution of (7.53), denoted by nt(z) which behaves like 

n t (z) ~ z'^ 1 for r -> . (7.55) 

We will discuss further below also this solution, which near r = is inadmissable in a quantum 
mechanical wave function, but admissable for r / 0. 
For large z values the solution of (7.53) is governed by 



dz 2 



+ 1 



the general solution of which is 



z 3i( z ) =0 for r — > oo 



sin(z + a) for r — > oo 



(7.56) 



(7.57) 



for some phase a. 

We note in passing that the functions ge(z) = ji{z),n^{z) obey the differential equation equivalent 
to (7.53) 

d 2 2d £{£ + !) 



dz 2 ^ z dz 



+ 1 



ge(z) = 



(7.58) 



Noting that sm(z + a) can be written as an infinite power series in z we attempt to express the 
solution of (7.53) for arbitary z values in the form 



j t (z) = /f(z 2 ), f{z 2 ) = J2 a nZ 2n 



(7.59) 



n=0 
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The unknown expansion coefficients can be obtained by inserting this series into (7.53). We have 
introduced here the assumption that the factor / in (7.59) depends on z 2 . This follows from 



from which we can conclude 



(- 

\dz 2 



2 d 

z dz 



. 



Introducing the new variable v = zi yields, using 



±JL - 2— — - 4v— 2 — 

z dz dv ' dz 2 dv 2 dv ' 



the differential equation 



( d 2 2£ + 3 d i 
\drf 



+ 



2v dv 4v 



f(v) 



(7.60) 



(7.61) 



(7.62) 



(7.63) 



which is consistent with the functional form in (7.59). The coefficients in the series expansion of 
f(z 2 ) can be obtained from inserting Y^=o a nZ 2n into (7.63) (v = z 2 ) 



£ (a n n(n-l)v n ~ 2 + l - (2£ + 3) a n v n ~ 2 + -^v^A - 

n n / 



2 v 4 
Changing the summation indices for the first two terms in the sum yields 



(a n+1 n(n-l) + X - (21 + 3) a n + ^ a„ J v 11 ' 1 = 0. 



(7.64) 



(7.65) 



In this expression each term ~ v n 1 must vanish individually and, hence, 



a n +i 



One can readily derive 



1 



1 



a i 



2 1! {21 + 3) 
The common factor ao is arbitrary. Choosing 



2 (n+ 1) (2n + 2^ + 3) 



O0 , «2 



4 2!(2£ + 3)(2£ + 5) 



a 



1 



a 



1-3-5 -(2t + l) 



the ensuing functions (£ = 0, 1, 2, . . .) 



k{z) 



1 -3-5- • -(2^ + 1) 



±z 2 



+ 



ih 2 ) 2 



l!(2^ + 3) 2!(2£ + 3)(2^ + 5) 



+ 



(7.66) 



(7.67) 



(7.68) 



(7.69) 



are called regular spherical Bessel functions. 
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One can derive similarly for the solution (7.55) the series expansion {£ = 0, 1,2, . . .) 



ni(z) = - 



1 ■ 3 ■ 5 ■ ■ ■ (2£ - 1) 



1 ~2 



1 - 



l\(l-2£) 2!(l-2£)(3-2*) 
These functions are called irregular spherical Bessel functions. 



+ ■ 



(7.70) 



Exercise 7.2.1: Demonstrate that (7.70) is a solution of (7.52) obeying (7.55). 



The Bessel functions (7.69, 7.70) can be expressed through an infinite sum which we want to specify 
now. For this purpose we write (7.69) 



U{z) 



\ 2 ) 1.3. 5. ..(*+!) 



2 2 2 



((f) 2 , 



mr 



+ 



The factorial-type products 



1 3 

2 ' 2 



' + 



can be expressed through the so-called Gamma-function 4 defined through 



f c 

T(z) = / 
J o 



This function has the following properties 5 

T(z+1) 
r(n+l) 

r(|) 

r^)r(i-z) 

from which one can deduce readily 

T(£+l) = y^. 

One can write then 



zT{z) 

n\ for n 6 N 



7T 
7T 



smirz 



1 3 

2 ' 2 



k{z) 



E 



(I) 



2n 



^n!T(n + l+£ + 



i- 



(7.71) 



(7.72) 



(7.73) 



(7.74) 
(7.75) 
(7.76) 

(7.77) 



(7.78) 



(7.79) 



4 For further details see Handbook of Mathematical Functions by M. Abramowitz and LA. Stegun (Dover Publica- 
tions, New York) 

5 The proof of (7.74-7.76) is elementary; a derivation of (7.77) can be found in Special Functions of Mathematical 
Physics by A.F. Nikiforov and V.B. Uvarov, Birkhauser, Boston, 1988) 
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Similarly, one can express ni{z) as given in (7.70) 



1 + 



(¥)' 



+ 



(f) / 



l\{\-£) 2!(§ -£)(!-£) 



+ 



Using (7.77) for z = £ + \, i.e., 



yields 



or 



nt+h) = (-i) 



7T 



n e (z) 



(-l) £+1 v^ 



yH+l 



ra-i) 



+ 



+ 



i+l oo 



i!r(i 



1 ^ 1 ^ 2 W ^ n!r(n + l-^-i 



(7.80) 



(7.81) 



(7.82) 
(7.83) 



Linear independence of the Regular and Irregular spherical Bessel Functions 

We want to demonstrate now that the solutions (7.69) and (7.69) of (7.53) are linearly independent. 
For this purpose we need to demonstrate that the Wronskian 



does not vanish. Let /i, f 2 be solutions of (7.53), or equivalently, of (7.58). Using 



dz 2 



7*1,2 



2 -/l2 + 

z dz z 



+ 1) f f 
- 2 — /l,2 - 71,2 



one can demonstrate the identity 



j- z W{hJ2) = -\w{h,f 2 ) 



This equation is equivalent to 



the solution of which is 



dz 



mW 



In W 



dz z 2 



c 

^2 



(7.84) 



(7.85) 



(7.86) 



(7.87) 



(7.88) 



for some constant c. For the case of f\ = je and f 2 = ng this constant can be determined using 
the expansions (7.69, 7.70) keeping only the leading terms. One obtains c = 1 and, hence, 

1 



W(je,n e ) 



?2 • 



(7.89) 



The Wronskian (7.89) doesn not vanish and, therefore, the regular and irregular Bessel functions 
are linearly independent. 
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Relationship to Bessel functions 

The differential equation (7.58) for the spherical Bessel functions ge(z) can be simplified by seeking 
the corresponding equation for G i+ i(z) defined through 



Using 



9t(z) = ^=G e+ i(z). 



-7-91(2) = —. = — G,,i(z) — - — 7 =Gf,i(z) 

d 2 , , Id 2 Id 3 

" 7zdz^ G ^ {z) - z^Tzdz G ^ Z) + Az^Tz G ^ {z) 



dz^ 



(7.90) 



(7.91) 



(7.92) 



one is lead to Bessel's equation 



£_ ld_ _ ^ ! 

dz 2 z dz z 2 



G v (z) = (7.93) 



where v = i + \. The regular solution of this equation is called the regular Bessel function. Its 
power expansion, using the conventional normalization, is given by [c.f. (7.79)] 



n=0 



One can show that J-„(z), defined through (7.94), is also a solution of (7.93). This follows from 
the fact that v appears in (7.93) only in the form u 2 . In the present case we consider solely the 
case v = £ + \. In this case J- V (z) is linearly independent of J v (z) since the Wronskian 

W(J t+ i, J_ e _i) = (-if— (7.95) 

is non- vanishing. One can relate J £+ i and J_^_i to the regular and irregular spherical Bessel 
functions. Comparision with (7.79) and (7.83) shows 



U(z) 



n t (z) = (-l)* +1 ^J_,_i(z). (7.97) 

These relationships are employed in case that since numerical algorithms provide the Bessel func- 
tions J u (z), but not directly the spherical Bessel functions je(z) and nt(z). 



Exercise 7.2.2: Demonstrate that expansion (7.94) is indeed a regular solution of (7.93). Adopt 
the procedures employed for the function je(z). 
Exercise 7.2.3: Prove the identity (7.95). 
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Generating Function of Spherical Bessel Functions 

The stationary Schrodinger equation of free particles (7.43) has two solutions, namely, one given 
by (7.44, 7.45) and one given by (7.50). One can expand the former solution in terms of solutions 
(7.50). For example, in case of a free particle moving along the a^-axis one expands 

e ik " X3 = a ^ h(kr) Y £m (9, 0) . (7.98) 

The l.h.s. can be written exp(i£x cos 9), i.e., the wave function does not depend on 0. In this case 
the expansion on the r.h.s. of (7.98) does not involve any non-vanishing m-values since Y£ m (9,(f)) 
for non- vanishing m has a non-trivial ^-dependence as described by (5.106). Since the spherical 
harmonics Yio(9,(f>), according to (5.178) are given in terms of Legendre polynomials i-£(cos#) one 
can replace the expansion in (7.98) by 

oo 

e lk3X3 = J2beh(kr)P e (cos9) . (7.99) 
e=o 

We want to determine the expansion coefficients &£. 
The orthogonality properties (5.179) yield from (7.99) 



/_ 



+1 dcos9e lkrcose P e (cos9) = b e j e (kr) . (7.100) 



Defining x = cos 9, z = kr, and using the Rodrigues formula for Legendre polynomials (5.150) 
one obtains 

f' dx e iz * P e (x) = j + ' dx e izx ± (* 2 - if . (7.101) 
Integration by parts yields 



J dx e lzx P e (x) 



1 



dx l ~ 



+i 



-l 



(7.102) 



2 l i\ J_ x \dx J dx 1 - 1 v ' 



One can show 

t _ x {x 2 - if ~ (x 2 - 1) x polynomial in x (7.103) 

and, hence, the surface term ~ [■ • ■]!]; vanishes. This holds for i consecutive integrations by part 
and one can conclude 

u dx elzx Peix) = w L dx {x2 ~ 1)1 &lzx 

= ^ f\x (1 - x'fe^ . (7.104) 
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Comparision with (7.100) gives 



biJi(kr) =M.J^dx(l- x 2 ) e e iz * . (7.105) 

This expression allows one to determine the expansion coefficients b(_. The identity (7.105) must 
hold for all powers of z, in particular, for the leading power x [c.f. (7.69)] 

z l 2 {izf f +1 



t l-3-5---(2£+l) 21 + 1 ' 2 l £\ 
Employing (5.117) one can write the r.h.s. 



J dx (1 - x 2 Y ■ (7.106) 



Z 1 2H\ [l-3-5---(2*-l)] 2 2^ + 1 (7 " 107) 



or 



t'W+V* 1 1.3. 5- ■■(2^ + 1) 2£+l 1.2-3-4---(2€-l)-2^ (7 " 108) 
where the last factor is equal to unity. Comparision with the l.h.s. of (7.106) yields finally 

b e = i e (2£+l) (7.109) 

or, after insertion into (7.99), 

00 

e ikrcose = {2£+l) j e {kr) P e {cos8) . (7.110) 
e=o 

One refers to the l.h.s. as the generating function of the spherical Bessel functions. 
Integral Representation of Bessel Functions 

Combining (7.105) and (7.109) results in the integral representation of je(z) 

U{z) = -fe^ ! +1 dx{l-x 2 Ye^ . (7.111) 



2 e +H\ 7_i 

Employing (7.96) one can express this, using v = £ + |, 

M.) = ^^ Y) (I)' £' * (1 - xV-i . (7.112, 
We want to consider the expression 

G v {z) = a„z v f v {z) (7.113) 

where we define 

f v (z) = [ dt (1 - i 2 )^! e iz * . (7.114) 
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Here C is an integration path in the complex plane with endpoints t±, ti- G v {z), for properly 
chosen endpoints t±,t2 of the integration paths C, obeys Bessel's equation (7.93) for arbitrary u. 
To prove this we note 



; / dt(l - t 2 ) v -\ te izt . 
Ic 



Integration by part yields 

— \ (1 - t 2 ) u+ ^ e izt 

2v + l L v ' 



2v + l 



c 



dt (1 - t 2 ) u+ -2 e izt 



In case that the endpoints of the integration path C satisfy 

*2 



{l-t 2 ) u+1 2 e izt 



ti 



one can write (7.116) 

m = 



^ + 1 Jc 



1 - t 2 ) u -^e lzt + / dt(l - t 2 ) v ~^ t 2 e izt 
c 



or 



From this we can conclude 



f"(z) + 2 ^/^) + fu(z) = 0. 
We note that equations (7.114, 7.120) imply also the property 

(2* +!)#(*) + zf u+1 {z) = 0. 



(7.115) 
(7.116) 

(7.117) 

(7.118) 
(7.119) 

(7.120) 
(7.121) 



Exercise 7.2.4: Prove (7.121). 

We can now demonstrate that G u {z) defined in (7.113) obeys the Bessel equation (7.93) as long as 
the integration path in (7.113) satisfies (7.117). In fact, it holds for the derivatives of G v {z) 

G' v {z) = - z a v z v f u {z) + a v z v f' v {z) (7.122) 

G"(z) = V -^pla v z»f v {z) + ^a^fUz) + f'J(z) 

(7.123) 

Insertion of these identities into Bessel's equation leads to a differential equation for f u (z) which is 
identical to (7.120) such that we can conclude that G v (z) for proper integration paths is a solution 
of (7.93). 

We consider now the functions 

= — - ( Z -Y I dx(l- * 2 r§ e*" , j = 1,2,3,4. (7.124) 
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for the four integration paths in the complex plane parametrized as follows through a real path 
length s 



Ci(ti = -1 - t 2 = +1) : 
C 2 (ti = 1 -► *2 = 1 + *oo) : 
C^ii = 1 + i oo — > i 2 = — 1 + i oo) : 

C 4 (ti = -1 + ioo -> ti = -1) : i 



t = s 



-1 < s < 1 



i = 1 + is < s < oo 
t = 1 + is -1 < s < 1 
— 1 + is < s < oo 



(7.125) 
(7.126) 
(7.127) 



(7.128) 

C = Ci U C2 U C3 U C4 is a closed path. Since the integrand in (7.124) is analytical in the part of 
the complex plane surrounded by the path C we conclude 

4 

J2 uU) (z) = (7.129) 
i=i 

The integrand in (7.124) vanishes along the whole path C3 and, therefore, u^'(z) = 0. Comparision 
with (7.112) shows u^\z) = J v (z). Accordingly, one can state 



Mz) 



u (2)( z ) + U W( Z ) 



(4), 



(7.130) 



We note that the endpoints of the integration paths C2 and C4, for Hez > and v £ M obey 
(7.117) and, hence, -u^^z) and u^ 2 ^(z) are both solutions of Bessel's equation (7.93). 
Following convention, we introduce the so-called Hankel functions 

HW(z) = -2u (2 \z) , HP(z) = -2u^(z). 

According to (7.130) holds 



J v (z) = \ [HW(z) + HP(z) 



(7.131) 
(7.132) 



For H ( J\z) one derives, using t = 1 + is, dt = ids, and 

{i-t 2 y~h izt = [(l-t)(l + t)]"-h izt 
= [-is(2 + is)}"-^e tz e- zs 
= e H*-K»/2+*/4)2>'-h [s(l + is/2)] u -^ e- zs , 



the integral expression 

HW(z) = e ^-W2-^/4) f zU r ds[s{l + is/2)Y^ 
Similarly, one can derive 



H<?Xz) = e -i(*-™/2-*/*) ^ ds [s(l - i a /2)]"-3 e 

(7.134) and (7.135) are known as the Poisson integrals of the Bessel functions. 



(7.133) 
(7.134) 

(7.135) 
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Asymptotic Behaviour of Bessel Functions 



We want to obtain now expansions of the Hankel functions H^f(z) in terms of z 1 such that 

the expansions converge fast asymptotically, i.e., converge fast for \z\ — > oo. We employ for this 
purpose the Poisson integrals (7.134) and (7.135) which read for v = I + \ 

where 



H^(z) = e^-^m^t_jY\z) (7.136) 



$\z) = ds[s(s ± is/2)] e e- zs . (7.137) 
Jo 

The binomial formula yields 



The formula for the Laplace transform of s n leads to 



H(^-r)! V 2/ Vz7 

r=0 v / \ / \ / 



and, hence, we obtain 



= V^e^-^fl f 4^ f±±Y (7.140) 
Bessel Functions with Negative Index 

Since v enters the Bessel equation (7.93) only as v 2 , H^\z) as well as H^l(z) are solutions of this 
equation. As a second order differential equation the Bessel equation has two linearly independent 
solutions. For such solutions g(z), h(z) to be linearly independent, the Wronskian W(g, h) must 
be a non- vanishing function. 

For the Wronskian connected with the Bessel equation (7.93) holds the identity 

W' = ~W (7.141) 

the derivation of which follows the derivation on page 192 for the Wronskian of the radial 
Schrodinger equation. The general solution of (7.141) is 

W(z) = -- . (7.142) 



In case of g{z) = H^i( z ) an d = Hg+i( z ) one can identify the constant c by using the 

leading terms in the expansions (7.140). One obtains 



W(z) = -— , (7.143) 

TTZ 
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(1 2) 

i.e., (z) are, in fact, linearly independent. 

One can then expand 



r(2) 



(7.144) 



The expansion coefficients A, B can be obtained from the asymptotic expansion (7.140). For 
\z\ — > oo the leading terms yield 



1 

Tz 



oo 



This equation can hold only for B = and ^4 = exp[i(£ + \)tt\. We conclude 



Simarly, one can show 



<+3 



^ h-(2) 



^+5 



(7.145) 

(7.146) 
(7.147) 



Spherical Hankel Functions 

In analogy to equations (7.90, 7.97) one defines the spherical Hankel functions 



(7.148) 



Following the arguments provided above (see page 193) the functions h\ ' '{z) are solutions of the 
radial Schrodinger equation of free particles (7.53). According to (7.96, 7.132, 7.148) holds for the 
regular spherical Bessel function 



\[hf\z) + hf\z)\ 



(7.149) 



We want to establish also the relationship between h!f ,2 \z) and the irregular spherical Bessel 
function n e (z) defined in (7.97). From (7.97, 7.132) follows 



n e (z) 



l A¥ 



(-1) 



e+i 



r(2) 



2 V 2z 

According to (7.146, 7.147) this can be written 



ng(z) 



2i 



hf\z) - h?>(z) 



Equations (7.149, 7.151) are equivalent to 



hf\z) = je(z) + in e (z) 

(2) 

K (z) = k{z) - ine(z) . 



(7.150) 



(7.151) 



(7.152) 
(7.153) 
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Asymptotic Behaviour of Spherical Bessel Functions 

We want to derive now the asymptotic behaviour of the spherical Bessel functions h!f ,2 \z), je(z) 
and rif(z). From (7.140) and (7.148) one obtains readily 

The leading term in this expansion, at large \z\, is 

h l )\z) = (T '^ +1 e ±iz . (7.155) 



To determine je(z) and ne(z) we note that for z G R the spherical Hankel functions h!f\z) and 
h £ (z), as given by (7.140, 7.148), are complex conjugates. Hence, it follows 

z € R : #(z) = Re[/i5 1} (z)] , n e //(z) = Imf/if^z)] . (7.156) 



Using 



P - Re 2^ rl ^ e _ r yy 2z ) ~ Z> (2r)!(£-2r)! \4z 2 J 



(7.157) 



i , . , T A {e + r)\ ( i \ 



X V^ 1 / 2 ' (*+2r+l)l /-l\r , > 1 

2z Z^ r =0 (2r+l)!(£-2r-l)! c — 1 (J 153) 

\ I = 

one can derive then from (7.154) the identities 

cos[z - (l+l)f] sin[z - (l + l)f] 

= P<(«) ^2 Qe( z ) (7.159) 

cos[z - (l+l)f] cosjz - £tt/2) 

n t {z) = — 2 q e (z) + pi{z) . (7.160) 

Employing cos[z — (£ + 1)|] . = sin(z — ^7r/2) and sin[z — [£ + 1)|] . = — cos(2; — £7r/2) results in 
the alternative expressions 

. , . sin(z - £tt/2) . . cos(z — iir/2) . . 

3t{z) = z P^ z ) + 2z2 «<(*) ( 7 - 161 ) 

. , sin(z — £n/2) , . cos(z — ^7r/2) , . ,_ 

n*(z) = 2 ^ g/(z) z ■ ^ 7 - 162 ) 

The leading terms in these expansions, at large \z\, are 

■ < ^ sin(z-^7r/2) 

#(z) = (7.163) 

/ \ cos(z — £ir/2) 

n t (z) = ^ '- 1 . (7.164) 
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Expressions for the Spherical bessel Functions je(z) and ne(z) 

The identities (7.161, 7.162) allow one to provide explicit expressions for je(z) and ne(z). One 
obtains for i = 0, 1, 2 

, , sinz 

jo(z) = — (7.165) 

n (z) = (7.166) 

• r \ sinz cosz in ik^ 

Ji(z) = — 7.167 

z z z 

/x cos z sin z ,_ . 

ni(z) = = (7.168) 

z z z 

/ 3 1\ 3 

h(z) = (-3 - "J sin2 " ^2 COSZ ( 7 - 169 ) 

(3 1 \ 3 
-^3 + -) COS;Z ~ ^2 sinz (7.170) 

Recursion Formulas of Spherical Bessel Functions 

The spherical Bessel functions obey the recursion relationships 

9e+i(z) = l - g e (z) - g' e (z) (7.171) 
2£ + 1 

9e+i(z) = — - — g e (z) - ge-i(z) (7.172) 

(1 2) — 

where ge(z) is either of the functions h £ ' (z), je(z) and ne(z). One can combine (7.171, 7.172) to 
obtain the recursion relationship 

( 9t+i(z) \ ( g e (z) \ 

{ ) = Mz) { m ) (7 - 173) 
i ~ -i 

AAz) = I i (7.174) 

z 2 z 

We want to prove these relationships. For this purpose we need to demonstrate only that the 

(1 2) 

relationships hold for gt(z) = ' (z). From the linearity of the relationships (7.171-7.174) and 
from (7.149, 7.151) follows then that the relatiosnhips hold also for ge(z) = je(z) and gt(z) = ne(z). 
To demonstrate that (7.171) holds for g e (z) = hf' 2 \z) we employ (7.124, 7.131, 7.148) and express 

« W . ^'W =-2^| 7 =^ TY) (1)7^^(1-^^. (7.175) 

Using T(£+ 1) = £\, defining a = — 1 and employing f u (z) as defined in (7.114) we can write 

z e 

9t{z) = a ^ f e+ i(z) ■ (7.176) 



202 



Spherically Symmetric Potentials 



The derivative of this expression is 

9\{z) = £ -g e (z) + a ^f' e+1 _(z). (7.177) 

Employing (7.121), i.e., 
yields, together with (7.176) 

9e(z) = ~ z 9t{z) - g e+1 (z) (7.179) 

from which follows (7.171). 

In order to prove (7.172) we differentiate (7.179) 

t t 

9e( z ) = --#9t{z) + -9i(z) - g' e+1 (z) . (7.180) 
Since ge(z) is a solution of the radial Schrodinger equation (7.58) it holds 

g'l{z) = -- z g' e (z) + ^±^g e (z) - ge(z) . (7.181) 

Using this identity to replace the second derivative in (7.180) yields 

P(f + 2) PA- 2 

g> +l (z) = ge (z) - -^^ge(z) + -±-!&z) • (7.182) 

Replacing all first derivatives employing (7.179) leads to (7.172). 

To prove (7.173, 7.174) we start from (7.171). The first component of (7.173), in fact, is equivalent 
to (7.171). The second component of (7.173) is equivalent to (7.182). 



Exercise 7.2.5: Provide a detailed derivation of (7.172). 

Exercise 7.2.6: Employ the recursion relationship (7.173, 7.174) to determine (a) ji(z), ]2(z) 
from jo(z), j'o(z) using (7.165), and (b) n±(z), n^^z) from no(z), n' (z) using (7.166). 



Chapter 8 



Interaction of Charged Particles with 
Electromagnetic Radiation 

In this Section we want to describe how a quantum mechanical particle, e.g., an electron in a 
hydrogen atom, is affected by electromagnetic fields. For this purpose we need to establish a suitable 
description of this field, then state the Hamiltonian which describes the resulting interaction. 

It turns out that the proper description of the electromagnetic field requires a little bit of effort. 
We will describe the electromagnetic field classically. Such description should be sufficient for high 
quantum numbers, i.e., for situations in which the photons absorbed or emitted by the quantum 
system do not alter the energy content of the field. We will later introduce a simple rule which 
allows one to account to some limited degree for the quantum nature of the electromagnetic field, 
i.e., for the existence of discrete photons. 



8.1 Description of the Classical Electromagnetic Field / Separa- 
tion of Longitudinal and Transverse Components 

The aim of the following derivation is to provide a description of the electromagnetic field which is 
most suitable for deriving later a perturbation expansion which yields the effect of electromagnetic 
radiation on a bound charged particle, e.g., on an electron in a hydrogen atom. The problem is that 
the latter electron, or other charged particles, are affected by the Coulomb interaction V(r) which is 
part of the forces which produce the bound state, and are affected by the external electromagnetic 
field. However, both the Coulomb interaction due to charges contributing to binding the particle, 
e.g., the attractive Coulomb force between proton and electron in case of the hydrogen atom, 
and the external electromagnetic field are of electromagnetic origin and, hence, must be described 
consistently. This is achieved in the following derivation. 

The classical electromagnetic field is governed by the Maxwell equations stated already in (1.27- 
1.29). We assume that the system considered is in vacuum in which charge and current sources 
described by the densities p(f, t) and J(f, t) are present. These sources enter the two inhomogeneous 
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Maxwell equations 

V-E(f,t) = 4vrp(f,i) (8.1) 

V x B(r,t) - d t E(r,t) = 4tt J(r,t) . (8.2) 

In addition, the two homogeneous Maxwell equations hold 

V x E(r, t) + d t B(r, t) = (8.3) 

V -B(r,t) = . (8.4) 



Lorentz Force A classical particle with charge q moving in the electromagnetic field experiences 
the so-called Lorentz force q[E(f, t) + v x B(f, t)] and, accordingly, obeys the equation of motion 

j t P = q { E[f (t),t] + v x B[f (t),t] } (8.5) 

where p is the momentum of the particle and r Q {t) it's position at time t. The particle, in turn, 
contributes to the charge density p(f,t) in (8.1) the term qd(r — f (t)) and to the current density 
J(f,t) in (8.2) the term qr 5{f — r {t)). In the non-relativistic limit holds p « mf and (8.5) above 
agrees with the equation of motion as given in (1.25). 



Scalar and Vector Potential Setting 

B(r, t) = V x A(f, t) (8.6) 

for some vector-valued function A(f, t), called the vector potential, solves implicitly (8.4). Equation 
(8.3) reads then 

Vx (E(r,t) + d t A(f,t)^j = (8.7) 

which is solved by 

E(f,t) + d t A(f,t) = -VV(f,t) (8.8) 
where V(f, t) is a scalar function, called the scalar potential. From this follows 

E(r,t) = -W(r,t) - d t A(f,t). (8.9) 



Gauge Transformations We have expressed now the electric and magnetic fields E(f, t) and 
B(f,t) through the scalar and vector potentials V(f, t) and A(r, t). As is well known, the rela- 
tionship between fields and potentials is not unique. The following substitutions, called gauge 
transformations, alter the potentials, but leave the fields unaltered: 

A(r,t) — ► A{f,t) + Vx(r,t) (8.10) 
Vftt) — V(r,t) - d tX (r,t) . (8.11) 

x We assume so-called Gaussian units. The reader is referred to the well-known textbook "Classical Electrody- 
namics", 2nd Edition, by J. D. Jackson (John Wiley & Sons, New York, 1975) for a discussion of these and other 
conventional units. 
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This gauge freedom will be exploited now to introduce potentials which are most suitable for the 
purpose of separating the electromagnetic field into a component arising from the Coulomb potential 
connected with the charge distribution p(r, t) and the current due to moving net charges, and a 
component due to the remaining currents. In fact, the gauge freedom allows us to impose on the 
vector potential A(f, t) the condition 

V • A(r,t) = . (8.12) 

The corresponding gauge is referred to as the Coulomb gauge, a name which is due to the form of 
the resulting scalar potential V(f, t). In fact, this potential results from inserting (8.9) into (8.1) 

V-(-Vy(r,t) - d t A(r,t)^j = 4tt p(f,t) . (8.13) 

Using V • dtA(f, t) = dtV ■ A(r,t) together with (8.12) yields then the Poisson equation 

V 2 V(f,t) = -±irp(f,t) . (8.14) 

In case of the boundary condition 

V(f,t) = for r £ dQ^ (8.15) 
the solution is given by the Coulomb integral 

V(r,t) = [ dV^4 (8.16) 



This is the potential commonly employed in quantum mechanical calculations for the description 
of Coulomb interactions between charged particles. 

The vector potential A(f,t) can be obtained employing (8.2), the second inhomogeneous Maxwell 
equation. Using the expressions (8.6) and (8.9) for the fields results in 

V x (V x A(f,t)^j + d t (W(r,t) + dtA(r,t) = 4vr J(r,t) . (8.17) 

The identity 

V x (V x I(r,t)) = V (v • A(r,t)) ~ V 2 A(f,t) (8.18) 
together with condition (8.12) leads us to 

V 2 A(f,t) - d 2 A(f,t) - d t VV(f,t) = -47rJ(r,t). (8.19) 

Unfortunately, equation (8.19) couples the vector potential A(f,t) and V(f, t). One would prefer 
a description in which the Coulomb potential (8.16) and the vector potential are uncoupled, such 
that the latter describes the electromagnetic radiation, and the former the Coulomb interactions 
in the unperturbed bound particle system. Such description can, in fact, be achieved. For this 
purpose we examine the offending term dtW(f, t) in (8.19) and define 

J e (f,t) = l-d t VV(f,t) . (8.20) 
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For the curl of Je holds 

V x J e (f,t) = . (8.21) 
For the divergence of Je(r,t) holds, using <9(V = Vdt and the Poisson equation (8.14), 

V-J e (f,t) = ±-d t V 2 V(r,t) = -d tP (f,t) (8.22) 

or 

V • J e (r, t) + dtp(r, t) = . (8.23) 

This continuity equation identifies Jg(r, t) as the current due to the time-dependence of the charge 
distribution p(f,t). Let J(r,t) be the total current of the system under investigation and let 
Jt = J — Je- For J also holds the continuity equation 

V- J(f,t) + d t p(f,t) = (8.24) 

and from this follows 

V-J t (f,t) = 0. (8.25) 

Because of properties (8.21) and (8.25) one refers to Je and Jt as the longitudinal and the transverse 
currents, respectively. 

The definitions of Je and Jt applied to (8.19) yield 

V 2 A(r,t) - d?A(r,t) = -4ir J t (?,t) . (8.26) 

This equation does not couple anymore scalar and vector potentials. The vector potential deter- 
mined through (8.26) and (8.12) and the Coulomb potential (8.16) yield finally the electric and 
magnetic fields. V(r, t) contributes solely an electric field component 

E t (f,t) = -VV(f,t) (8.27) 

which is obviously curl-free (V x Ee(f,t) = 0), hence, the name longitudinal electric field. A(f,t) 
contributes an electrical field component as well as the total magnetic field 

E t (f,t) = -d t A(f,t) (8.28) 
B t (f,t) = VxA(f,t) . (8.29) 

These fields are obviously divergence -free (e.g., V • E t (r, t) =0), hence, the name transverse fields. 



8.2 Planar Electromagnetic Waves 

The current density Jt describes ring- type currents in the space under consideration; such current 
densities exist, for example, in a ring-shaped antenna which exhibits no net charge, yet a current. 
Presently, we want to assume that no ring-type currents, i.e., no divergence-free currents, exist in 
the space considered. In this case (8.26) turns into the well-known wave equation 



V 2 A(f,t) - d 2 A(r,t) = 



(8.30) 
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which describes electromagnetic fields in vacuum. A complete set of solutions is given by the 
so-called plane waves 

A(f,t) = Aoiiexp i(k-r^ojt) (8.31) 
where the dispersion relationship 

\k\ = lo (8.32) 

holds. Note that in the units chosen the velocity of light is c = 1. Here the "-" sign corresponds 
to so-called incoming waves and the "+" sign to outgoing waves 2 , the constant k is referred to as 
the wave vector. The Coulomb gauge condition (8.12) yields 

u-k = . (8.33) 

u is a unit vector (\u\ = 1) which, obviously, is orthogonal to k; accordingly, there exist two linearly 
independent orientations for u corresponding to two independent planes of polarization. 
We want to characterize now the radiation field connected with the plane wave solutions (8.31). 
The corresponding electric and magnetic fields, according to (8.28, 8.29), are 

E t (f,t) = ±iuA(r,t) (8.34) 
B t (r,t) = ikxA(r,t) . (8.35) 

The vector potential in (8.31) and the resulting fields (8.34, 8.35) are complex-valued quantities. 
In applying the potential and fields to physical observables and processes we will only employ the 
real parts. 

Obviously, Et(r,t) and Bt(f,t) in (8.34, 8.35), at each point f and moment t, are orthogonal to 
each other and are both orthogonal to the wave vector k. The latter vector describes the direction 
of propagation of the energy flux connected with the plane wave electromagnetic radiation. This 
flux is given by 

S{f,t) = —ReE t (f,t)xReB(f,t). (8.36) 
Air 

Using the identity a x (b x c) = b(a ■ c) — c(a- b) and (8.31, 8.32, 8.34, 8.35) one obtains 

S(f,t) = ±—\A \ 2 k sin 2 (k-f-ujt) (8.37) 
Air 

where k is the unit vector k = k/\k\. Time average over one period 2tt/oj yields 

(S(r,t)} = ±^-\ Ao \ 2 k. (8.38) 

In this expression for the energy flux one can interprete k as the propagation velocity (note c = 1) 
and, hence, 

{e) = ^ \A \ 2 (8.39) 



2 The definition incoming waves and outgoing waves is rationalized below in the discussion following Eq. (8.158); 
see also the comment below Eqs. (8.38, 8.39). 
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as the energy density. The sign in (8.38) implies that for incoming waves, defined below 
Eqs. (8.31,8.32), the energy of the plane wave is transported in the direction of —k, whereas in 
the case of outgoing waves the energy is transported in the direction of k. 

A correct description of the electromagnetic field requires that the field be quantized. A 'poor 
man's' quantization of the field is possible at this point by expressing the energy density (8.39) 
through the density of photons connected with the planar waves (8.31). These photons each carry 
the energy Hto. If we consider a volume V with a number of photons M u the energy density is 
obviously 

w = ^ ■ («») 

It should be pointed out that AC represents the number of photons for a specific frequency lo, a 
specific k and a specific u. Comparision of (8.39) and (8.40) allows one to express then the field 
amplitudes 



A 



8ttMJi 
loV 



3.41) 



Inserting this into (8.31) allows one finally to state for the planar wave vector potential 



A(f,t) 



8vrA/> 



uexp 



i(k • f — ut) 



\k\ = co , u ■ k = 



(8.42) 



8.3 Hamilton Operator 

The classical Hamiltonian for a particle of charge q in a scalar and vector potential V(r) and A(f, t), 
respectively, is 



H 



p - qA(f,t) 



2 m 



+ qV(r) 



(8.43) 



Here the fields are defined through Eqs. (8.27, 8.28, 8.29) together with the potentials (8.16, 8.31). 
The integrals express the integration over the energy density of the fields. Note that E^(f, t) is real 
and that E t (f, t), B t (f, t) are complex leading to the difference of a factor \ in the energy densities 
of the lontitudinal and transverse components of the fields. 

We assume that the energy content of the fields is not altered significantly in the processes described 
and, hence, we will neglect the respective terms in the Hamiltonian (8.43). We are left with a 
classical Hamiltonian function which has an obvious quantum mechanical analogue 



H 



p- qA(f,t) 



2 m 



+ qV(f) 



(8.44) 



replacing the classical momentum p by the differential operator p 
of the particle is then described by the Schrodinger equation 



V. The wave function ^(r, t) 



ihd t V(r,t) = HV(r,t) 



(8.45) 
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Gauge Transformations It is interesting to note that in the quantum mechanical description 
of a charged particle the potentials V(f, t) and A(f, t) enter whereas in the classical equations of 
motion 

mf = qE(f,t) + qf x B(f,t) (8.46) 

the fields enter. This leads to the question in how far the gauge transformations (8.10, 8.11) affect 
the quantum mechanical description. In the classical case such question is mute since the gauge 
transformations do not alter the fields and, hence, have no effect on the motion of the particle 
described by (8.46). 

Applying the gauge transformations (8.10, 8.11) to (8.44, 8.45) leads to the Schrodinger equation 



ihd t fy(f,t) 



p - qA- g((Vx)) 



2m 



+ qV - q{{d tX )) 



*(r, t) 



(8.47) 



where ((• • •)) denotes derivatives in ((Vx)) and {{dtx)) which are confined to the function xif^) 
inside the double brackets. One can show that (8.47) is equivalent to 



ihd t e iqx{r ^ )/h *(f,t) 



p — qA 



2m 



+ qV 



(8.48) 



For this purpose one notes 

ihd t e*<?x(^)/%(r, t) = e iqx ^l h [ihd t - q((dtx))] *(r,t) 



Jqx(r,t)/h 



p + g((Vx)) 



V(r,t) 



(8.49) 
(8.50) 



The equivalence of (8.47, 8.48) implies that the gauge transformation (8.10, 8.11) of the potentials 
is equivalent to multiplying the wave function \l/(r, t) by a local and time-dependent phase factor 
e mx(r,t)/h_ Obviously, such phase factor does not change the probability density |*(r, t)\ 2 and, 
hence, does not change expectation values which contain the probability densities 3 . 
An important conceptual step of modern physics has been to turn the derivation given around and 
to state that introduction of a local phase factor e %qx ^ r,t ^ h should not affect a system and that, 
accordingly, in the Schrodinger equation 



ihd t ^{f, t) 



p — qA 



2m 



+ qV 



^(f,t) 



5.51) 



the potentials A(f, t) and V(f, t) are necessary to compensate terms which arise through the phase 
factor. It should be noted, however, that this principle applies only to fundamental interactions, 
not to phenomenological interactions like the molecular van der Waals interaction. 
The idea just stated can be generalized by noting that multiplication by a phase factor e tqx ^ r ' t ^ h 
constitutes a unitary transformation of a scalar quantity, i.e., an element of the group U(l). Ele- 
mentary constituents of matter which are governed by other symmetry groups, e.g., by the group 



3 The effect on other expectation values is not discussed here. 
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SU(2), likewise can demand the existence of fields which compensate local transformations de- 
scribed by e ia '^ r,t ^ where a is the vector of Pauli matrices, the generators of SU(2). The resulting 
fields are called Yang- Mills fields. 
The Hamiltonian (8.44) can be expanded 



- 2 2 
P " ' 



H 

2m 2m 



(p-A + A-p} + ^-A 2 + qV (8.52) 



For any function f(r) holds 

($.A- A-t>)f{r) = - { (I-V/ + fV-A - A-Vf) = -J^-A. (8.53) 

This expression vanishes in the present case since since V • A = [cf. (8.12)]. Accordingly, holds 

0-Af = A-pf (8.54) 

and, consequently, 

- 2 2 

H = I 3_p. a + ^—A 2 + qV . (8.55) 

2m m 2m 

8.4 Electron in a Stationary Homogeneous Magnetic Field 

We consider now the motion of an electron with charge q = — e and mass m = m e in a homogeneous 
magnetic field as described by the Schrodinger equation (8.45) with Hamiltonian (8.55). In this 
case holds V(r, t) = 0. The stationary homogeneous magnetic field 

B(r, t) = B , (8.56) 

due to the gauge freedom, can be described by various vector potentials. The choice of a vector 
potential affects the form of the wave functions describing the eigenstates and, thereby, affects the 
complexity of the mathematical derivation of the wave functions. 

Solution for Landau Gauge A particularly convenient form for the Hamiltonian results for 
a choice of a so-called Landau gauge for the vector potential A(f,t). In case of a homogeneous 
potential pointing in the 23-direction, e.g., for B Q = B es in (8.56), the so-called Landau gauge 
associates the vector potential 

A L (r) = B oXl e 2 (8.57) 

with a homogeneous magnetic field B Q . The vector potential (8.57) satisfies V • A = and, 
therefore, one can employ the Hamiltonian (8.55). Using Cartesian coordinates this yields 

H = ~{dl + dl + dl) + E -^ Xl d 2 + ^xj (8.58) 
2m ft " im e 2m e 



where dj = (d/dxj), j = 1,2,3. 

We want to describe the stationary states corresponding to the Hamiltonian (8.58). For this purpose 
we use the wave function in the form 



k 2 ,h;xi,X2,xs) = exp(ifc 2 x 2 + ifoxs) (f> E (xi) 



(8.59) 
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This results in a stationary Schrodinger equation 

^2 fc2/„2 k 2 U 2 



2m. 



/i /bo ^1 /bo 

<9{ + 7; — + 



2m f 



2m P 



+ 



eB hk2 e 2 B 2 2 
xi + — x 



2m f 



Completing the square 

e 2 B 2 



2m e 



leads to 



where 



and where 



2m, 



Of 



+ 



1 



eB Q hk2 



2f?2 



xi 



2m f 



= E(j> E {x{) 



h 2 k 2 

2m e 



m e u? (xi + xi )^ 



2m P 



^(xi) = E(f) E (xi) 



Xlo 



oj = 



hk 2 

~e~B~o 

eBo 

m fi 



(8.60) 

(8.61) 

(8.62) 
(8.63) 
(8.64) 



is the classical Larmor frequency (c = 1). It is important to note that the completion of the square 
absorbs the kinetic energy term of the motion in the ^-direction described by the factor exp(ik 2 x 2 ) 
of wave function (8.59). 

The stationary Schrodinger equation (8.62) is that of a displaced (by x\ ) harmonic oscillator with 
shifted (by h 2 k 2 /2m e ) energies. From this observation one can immediately conclude that the wave 
function of the system, according to (8.59), is 



fy(n,k 2 ,fo;xi,X2,x 3 ) 



i 



V2 n n! 



*g \ * exp 



m fi uj(x 1 +xi ) 2 

2h 



exp(ik 2 x 2 + ik^xz) x 

H n (y^W(x l + X lo )) 



(8.65) 



where we replaced the parameter E by the integer n, the familiar harmonic oscillator quantum 
number. The energies corresponding to these states are 



E(n,k 2 ,k 3 ) 



+ 2 )+ 2mZ 



(8.66) 



Obviously, the states are degenerate in the quantum number k 2 describing displacement along the 
x 2 coordinate. Without affecting the energy one can form wave packets in terms of the solutions 
(8.65) which localize the electrons. However, according to (8.63) this induces a spread of the wave 
function in the xi direction. 



Solution for Symmetric Gauge The solution obtained above has the advantage that the deriva- 
tion is comparatively simple. Unfortunately, the wave function (8.65), like the corresponding gauge 
(8.57), is not symmetric in the X\- and X2-coordinates. We want to employ , therefore, the so-called 
symmetric gauge which expresses the homogeneous potential (8.56) through the vector potential 



A(r) = -B xf. 



(8.67) 
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One can readily verify that this vector potential satisfies the condition (8.12) for the Coulomb 
gauge. 

For the vector potential (8.67) one can write 

p- A = j. V • B x f . (8.68) 

Using V • (m x t?) = — u ■ V x v + v ■ V x u yields, in the present case of constant B Q , for any 
function f(f) 

P-Af = -B -i>xrf . (8.69) 
The latter can be rewritten, using V x (uf) = —u x V/ + /V x u and V x r = 0, 

B -pxrf = B -(fxp) f . (8.70) 
Identifying r x p with the angular momentum operator L, the Hamiltonian (8.52) becomes 

-_2 2 2 

Of particular interest is the contribution 



y mag = ^—L-B (8.72) 
2m P 



to Hamiltonian (8.71). The theory of classical electromagnetism predicts an analogue energy con- 
tribution , namely, 

Knag = — A*class ' B Q (8.73) 

where /U c i ass is the magnetic moment connected with a current density j 

/"class = ^ J rxj(f) dr (8.74) 

We consider a simple case to relate (8.72) and (8.73, 8.74), namely, an electron moving in the 
x, y-plane with constant velocity v on a ring of radius r. In this case the current density measures 
— ev oriented tangentially to the ring. Accordingly, the magnetic moment (8.74) is in the present 
case 

Mclass = - -erv e 3 . (8.75) 
The latter can be related to the angular momentum £ c i ass = r S3 of the electron 

A'class = 7; ^class (8.76) 

and, accordingly, 

€■-*-* 

Knag = 4lass ' B Q . (8.77) 

Comparision with (8.72) allows one to interpret 
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as the quantum mechanical magnetic moment operator for the electron (charge — e). 
We will demonstrate in Sect. 10 that the spin of the electron, described by the operator S, likewise, 
gives rise to an energy contribution (8.72) with an associated magnetic moment — g S where 
g « 2. A derivation of his property and the value of g, the so-called gyromagnetic ratio of the 
electron, requires a Lorentz-invariant quantum mechanical description as provided in Sect. 10. 
For a magnetic field (8.56) pointing in the ^-direction the symmetric gauge (8.67) yields a more 
symmetric solution which decays to zero along both the ±x±- and the indirection. In this case, 
i.e., for B Q = B Q es, the Hamiltonian (8.71) is 

To obtain the stationary states, i.e, the solutions of 

HV E (xi,x 2 ,x 3 ) = EV E (x 1 ,x 2 ,x 3 ) , (8.80) 
we separate the variable x\, x 2 from x 3 setting 

*e(xi,x 2 ,x 3 ) = exp(i/c 3 x 3 ) tp(xi,x 2 ) ■ (8.81) 
The functions i(j(xi,x 2 ) obey then 

H ^( Xl ,x 2 ) = E'^{xi,x 2 ) (8.82) 

where 

h 2 1 1 

H Q = -- — (8 2 + dl) + -m e co 2 (xj + x 2 2 ) + hw-{ Xl d 2 - x 2 d x ) (8.83) 

h 2 k 2 

E' = E — — — ^ . (8.84) 
2m e v ' 

We have used here the expression for the angular momentum operator 

L 3 = (h/i)( Xl d 2 -x 2 9i) . (8.85) 

The Hamiltonian (8.83) describes two identical oscillators along the xi-and ^-directions which 
are coupled through the angular momentum operator L 3 . Accordingly, we seek stationary states 
which are simultaneous eigenstates of the Hamiltonian of the two-dimensional isotropic harmonic 
oscillator 

H osc = (df + 8l) + l -m e uj 2 (xj + x 2 ) (8.86) 

as well as of the angular momentum operator L 3 . To obtain these eigenstates we introduce the 
customary dimensionless variables of the harmonic oscillator 



X 3 = J-t-Xj ,j = l,2. (8.87) 



(8.83) can then be expressed 
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Employing the creation and annihilation operators 

4 - {*> ~ tk) ' = T2 ( Xi + 4) ;i = 1 ' 2 (8 ' 89) 

and the identity 

<jj L3 = - (^a\a 2 — a\a\ ^ , (8.90) 
which can readily be proven, one obtains 

— H = a\a\ + a\a 2 + 1 + - ^aja 2 — <4°i ) ■ (8.91) 

We note that the operator a\a 2 — a\a\ leaves the total number of vibrational quanta invariant, 
since one phonon is annihilated and one created. We, therefore, attempt to express eigenstates in 
terms of vibrational wave functions 

V(j,m;x 1 ,x 2 ) = \ / -)=4=^(0,0;ai,X2) (8.92) 

where ^(0, 0; x±, x 2 ) is the wave function for the state with zero vibrational quanta for the X\- as 
well as for the X2-oscillator. (8.92) represents a state with j + m quanta in the xi-oscillator and 
j — m quanta in the X2-oscillator, the total vibrational energy being Hu(2j + 1). In order to cover 
all posible vibrational quantum numbers one needs to choose j, m as follows: 

i = 0, -, 1, , m = -j, + . (8.93) 

The states (8.92) are not eigenstates of L3. Such eigenstates can be expressed, however, through a 
combination of states 

j 

V'(j,m';x 1 ,x 2 ) = a m!n'*0'>ra;si>32) • (8.94) 

m=-j 

Since this state is a linear combination of states which all have vibrational energy (2j + 1)Hlu, (8.94) 
is an eigenstate of the vibrational Hamiltonian, i.e., it holds 

(a}ai + a\a 2 + 1 ) *'(j, m'; x u x 2 ) = ( 2 j + 1 ) m'; Xi, x 2 ) . (8.95) 

We want to choose the coefficients a>^ m , such that (8.94) is also an eigenstate of L3, i.e., such that 

- (^a\a 2 — a\ai ^ ^'(j,m';xi,x 2 ) = 2m' ^'(j, m'; x±, x 2 ) (8.96) 

holds. If this property is, in fact, obeyed, (8.94) is an eigenstate of H Q 

4*'(j>';n,i2) = fiw (2j + 2m' + 1) m'; xi, x 2 ) . (8.97) 
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(7) 

In order to obtain coefficients or^ m , we can profitably employ the construction of angular momentum 
states in terms of spin-^ states as presented in Sects. 5.9,5.10,5.11. If we identify 

a\, ai, a\,d2 < — ► b + , b + , b_, (8.98) 

V v ' * v ' 

present notation notation in Sects. 5.9,5.10,5.11 

then the states ^(j, m; xi, X2) defined in (8.92) correspond to the eigenstates \^f(j,m)) in Sect. 5.9. 
According to the derivation given there, the states are eigenstates of the operator [we use for the 
operator the notation of Sect. 5.10, cf. Eq. (5.288)] 

^3 = g ( a i°i " °2°2) (8-99) 

with eigenvalue m. The connection with the present problem arises due to the fact that the operator 
J2 in Sect. 5.10, which corresponds there to the angular momentum in the ^-direction, is in the 
notation of the present section 

' h = ki H" 2 " a 2 ai ) ' (8 - 100) 

i.e., except for a factor Jj, is identical to the operator L3 introduced in (8.84) above. This implies that 
we can obtain eigenstates of L3 by rotation of the states m; x±, X2)- The required rotation must 
transform the X3-axis into the X2~axis. According to Sect. 5.11 such transformation is provided 
through 

y'(j,m';x 1 ,x 2 ) = D% ml (^,^,0)V(j,m;x 1 ,x 2 ) (8.101) 

where £^m'(f ' f ^ s a rotation matrix which describes the rotation around the X3~axis by | 
and then around the new X2~axis by |, i.e., a transformation moving the X3-axis into the X2- 

axis. The first rotation contributes a factor exp(— zm|), the second rotation a factor d^m'd )> ^ ne 
latter representing the Wigner rotation matrix of Sect. 5.11. Using the explicit form of the Wigner 
rotation matrix as given in (5.309) yields finally 



( J+J-t) ( ^t^ ) (-V j - m '- t (-i) m nj,rn;x 1 ,X2) . (8.102) 
We have identified, thus, the eigenstates of (8.83) and confirmed the eigenvalues stated in (8.97). 



8.5 Time-Dependent Perturbation Theory 

We want to consider now a quantum system involving a charged particle in a bound state perturbed 
by an external radiation field described through the Hamiltonian (8.55). We assume that the 
scalar potential V in (8.55) confines the particle to stationary bound states; an example is the 
Coulomb potential V(r,t) = 1/Airr confining an electron with energy E < to move in the well 
known orbitals of the hydrogen atom. The external radiation field is accounted for by the vector 
potential A(f, t) introduced above. In the simplest case the radiation field consists of a single 
planar electromagnetic wave described through the potential (8.31). Other radiation fields can 
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be expanded through Fourier analysis in terms of such plane waves. We will see below that the 
perturbation resulting from a 'pure' plane wave radiation field will serve us to describe also the 
perturbation resulting from a radiation field made up of a superposition of many planar waves. 
The Hamiltonian of the particle in the radiation field is then described through the Hamiltonian 

H = H + V s (8.103) 

H = ^ + qV ^ (8.104) 

V s = -±p-A{r,t) + ^A 2 (f,t) (8.105) 
m 2m 

where A(f, t) is given by (8.42). Here the so-called unperturbed system is governed by the Hamil- 
tonian H Q with stationary states defined through the eigenvalue problem 

H Q \n) = e n \n) , n = 0, 1, 2... (8.106) 

where we adopted the Dirac notation for the states of the quantum system. The states |n) are 
thought to form a complete, orthonormal basis, i.e., we assume 

(n\m) = 5 nm (8.107) 

and for the identity 

oo 

]1 = ^|n)(n| . (8.108) 

?1=0 

We assume for the sake of simplicity that the eigenstates of H Q can be labeled through integers, 
i.e., we discount the possibility of a continuum of eigenstates. However, this assumption can be 
waved as our results below will not depend on it. 



Estimate of the Magnitude of Vs 

We want to demonstrate now that the interaction Vs(t), as given in (8.105) for the case of radiation- 
induced transitions in atomic systems, can be considered a weak perturbation. In fact, one can 
estimate that the perturbation, in this case, is much smaller than the eigenvalue differences near 
typical atomic bound states, and that the first term in (8.105), i.e., the term ~ p- A(f,t), is much 
larger than the second term, i.e., the term ~ A 2 (f, t). This result will allow us to neglect the 
second term in (8.105) in further calculations and to expand the wave function in terms of powers 
of Vs(t) in a perturbation calculation. 

For an electron charge q = — e and an electron mass m = m e one can provide the estimate for the 
first term of (8.105) as follows 4 . We first note, using (8.41) 



— p ■ A 

m e 

4 The reader should note that the estimates are very crude since we are establishing an order of magnitude estimate 
only. 



2m P 



P 

2m P 



8ttA/> 
wV 



.109) 
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The virial theorem for the Coulomb problem provides the estimate for the case of a hydrogen atom 



2m f 



le 2 
2 a Q 



(8.110) 



where a Q is the Bohr radius. Assuming a single photon, i.e., AC = 1, a volume V = A 3 where A is 
the wave length corresponding to a plane wave with frequency to, i.e., A = 2irc/\, one obtains for 
(8.109) using V = A4vr 2 c 2 /w 2 



— p • A 

m e 


e 2 


2 




47ra 


7T 


A m e c 2 



(8.111) 



For ftu; = 3 eV and a corresponding A = 4000 A one obtains, with a Q « 0.5 A, and m e c 2 
500 keV 

2 a D foj 



it A m P c 2 



10" 



and with e 2 /a Q « 27 eV, altogether, 



e - r 
— p • t4 



- 10 eV • 10" 4 = 10" 3 eV . 



(8.112) 



(8.113) 



This magnitude is much less than the differences of the typical eigenvalues of the lowest states of 
the hydrogen atom which are of the order of 1 eV. Hence, the first term in (8.105) for radiation 
fields can be considered a small perturbation. 

We want to estimate now the second term in (8.105). Using again (8.41) one can state 



p 2 

e A 2 



2m e 



e 2 1 %-KMujhjj 



2m P to 2 V 



For the same assumptions as above one obtains 



p 2 

6 A 2 



2m e 



8ira 



a 4:Huj 



A m P c 2 



Employing for the second factor the estimate as stated in (8.112) yields 



2m P 



10 eV • 10" 



10~ 7 eV 



(8.114) 



U15) 



(8.116) 



This term is obviously much smaller than the first term. Consequently, one can neglect this term as 
long as the first term gives non- vanishing contributions, and as long as the photon densities M^/V 
are small. We can, hence, replace the perturbation (8.105) due to a radiation field by 



V S 



m 



(8.117) 



In case that such perturbation acts on an electron and is due to superpositions of planar waves 
described through the vector potential (8.42) it holds 



e 
m 



E 



k,u 



kV 



a(k,u) p- u exp i(k ■ r — ut) 



(8.118) 
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where we have replaced u in (8.42) through k = \k\ = u. The sum runs over all possible k vectors 
and might actually be an integral, the sum over u involves the two possible polarizations of planar 
electromagnetic waves. A factor a(k, u) has been added to describe eliptically or circularly polarized 
waves. Equation (8.118) is the form of the perturbation which, under ordinary circumstances, 
describes the effect of a radiation field on an electron system and which will be assumed below to 
describe radiative transitions. 



Perturbation Expansion 

The generic situation we attempt to describe entails a particle at time t = t Q in a state |0) and 
a radiation field beginning to act at t = t on the particle promoting it into some of the other 

states |n), n = 1, 2, The states |0), \n) are defined in (8.106-8.108) as the eigenstates of the 

unperturbed Hamiltonian H a . One seeks to predict the probability to observe the particle in one 
of the states |n), n ^ at some later time t > t Q . For this purpose one needs to determine the 
state |\Pg(i)) of the particle. This state obeys the Schrodinger equation 



ihdt\* s (t)) = [H + V s (t)]\*s(t)) 
subject to the initial condition 

\*s(t )) = |0> . 

The probability to find the particle in the state |n) at time t is then 

PO^n(t) = IH*s(t)>| 2 • 



(8.119) 
(8.120) 

(8.121) 



In order to determine the wave function we choose the so-called Dirac representation defined 

through 



!**(*)> 



exp 



h 



H {t - to) 



\*D{t)) 



where 
Using 

ih dtexp 
and (8.119) one obtains 



\*D(t )) = |0) • 



H Q {t - t ) 



H exp 



H 



H Q {t - t ) 



(8.122) 
(8.123) 
(8.124) 



exp 



h 



H Q {t - to) 



(Ho + ihd t ) \* D (t)) 



= [Ho + V s (t)] exp 



\*D(t)) 



from which follows 



exp 



h 



H a (t - Q 



ihdt\V D (t)) = V s (t)exp 



-^H (t - t ) 



\*D{t)) 



Multiplying the latter equation by the operator exp [| H (t — t Q )] yields finally 

m\*D(t)) = !*£»(*)) , = io) 



(8.125) 



(8.126) 



.127) 
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where 



V D (t) 



exp 



H Q {t - t ) 



V s (t)exp 



n 



H Q (t - Q 



We note that the transition probability (8.121) expressed in terms of *!'£>(£)) is 



Po-mO) = |(n|exp 



h 



H Q {t - t ) 



l*D(i))| 2 - 



(8.128) 



(8.129) 



Due to the Hermitean property of the Hamiltonian H Q holds (n\H = e n {n\ and, consequently, 



(n|exp 



j.H (t - Q 



exp 



--e„(t - t ) 



(n\ 



(8.130) 



(8.131) 



(8.132) 



from which we conclude, using |exp[— |e n (t — t Q )]\ = 1, 

Po-n(t) = \(n\V D (t))\ 2 ■ 
In order to determine \^£)(t)) described through (8.127) we assume the expansion 

oo 

\9 D (t)) = X>S>(t)> 

n=0 

where \^Sf^\t)} accounts for the contribution due to n-fold products of Vo(t) to | Accord- 
ingly, we define \typ\t)) through the evolution equations 

ihd t \^\t)) = o 



m\^\t)) 



V D (t)\^\t)) 
V D (t)\^\t)) 



(8.133) 
(8.134) 
(8.135) 



ihd t \^\t)) 



(8.136) 



together with the initial conditions 



|0) for n = 

for n = 1, 2. . . . 



(8.137) 



One can readily verify that (8.132-8.137) are consistent with (8.127, 8.128). 

Equations (8.133-8.137) can be solved recursively. We will consider here only the two leading 
contributions to From (8.133, 8.137) follows 



I *!?(*)> = |0> • 



Employing this result one obtains for (8.134, 8.137) 



(8.138) 



(8.139) 
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This result, in turn, yields for (8.135, 8.137) 

2 ft 



l*D (*)) = (^) l dt 'j t dt"V D (t')V D (t")\0) . (8.140) 
Altogether we have provided the formal expansion for the transition amplitude 

(n\* D (t)) = (n\0) + ^ f dt' (n\V D (t')\0) (8.141) 

+ V k / dt' dt"(n\V D (t')\m)(m\V D (t")\0) + ... 

m=0 V 1 "/ ■'*» ^ 

8.6 Perturbations due to Electromagnetic Radiation 

We had identified in Eq. (8.118) above that the effect of a radiation field on an electronic system 
is accounted for by perturbations with a so-called harmonic time dependence ~ exp(— iujt). We 
want to apply now the perturbation expansion derived to such perturbations. For the sake of 
including the effect of superpositions of plane waves we will assume, however, that two planar 
waves simulataneously interact with an electronic system, such that the combined radiation field 
is decribed by the vector potential 

A(f, t) = Ai ii\ exp i {k\ ■ r — <jj\t) incoming wave (8.142) 



+ A 2 u 2 exp i (k 2 • r =F ^2 1) 



incoming or outgoing wave 



combining an incoming and an incoming or outgoing wave. The coefficients A±, A 2 are defined 
through (8.41). 

The resulting perturbation on an electron system, according to (8.118), is 

V s = Vi exp(-iwit) + V 2 exp(=R^2*) e xt , A -> Of , t -> -oo (8.143) 
where V\ and V 2 are time-independent operators defined as 

Uj e lk ' r . (8.144) 



e /8vrA^/i ^ n , r 




II in 

Here the factor I describes the strength of the radiation field (for the specified planar wave) as 
determined through the photon density Mj/V and the factor II describes the polarization of the 
planar wave; note that itj, according to (8.34, 8.142), defines the direction of the -E-field of the 
radiation. The factor III in (8.144) describes the propagation of the planar wave, the direction of 
the propagation being determined by A; = k/\k\. We will demonstrate below that the the sign of 
=Ficji determines if the energy of the planar wave is absorbed ( "-" sign) or emitted ( "+" sign) by 
the quantum system. In (8.144) r is the position of the electron and p = (h/i)V is the momentum 
operator of the electron. A factor exp(At), A — > 0+ has been introduced which describes that at 
time t — ► — oo the perturbation is turned on gradually. This factor will serve mainly the purpose of 
keeping in the following derivation all mathematical quantities properly behaved, i.e., non-singular. 
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1st Order Processes 



We employ now the perturbation (8.143) to the expansion (8.141). For the 1st order contribution 
to the transition amplitude 



j t 



we obtain then, using (8.128), (8.130) and (for m = 0) 



exp 



n 



H Q {t - t ) 



\m) = exp 



h 



£m{t to) 



\m) , 



(8.145) 



(8.146) 



(e n - e c - ihX) t' 



for (8.145) 

1 I' 1 

(n|* ( nVt)) = lim lim — / dt' exp 

x (<n|Vi|0) e-' lWlt ' + (n\V 2 \0) e ^ 2 *' ) . 
Carrying out the time integration and taking the limit lim^-oo yields 



(8.147) 



H*g>(t)> 



lim e xt 



<n|Vi|0> 
-H (n\V 2 \0) 



exp [|(e„ - e D - /kji) t] 
e Q + — e n + iA/i 

exp [|(e n - e D T ^2) * 



+ 



(8.148) 



2nd Order Processes 



We consider now the 2nd order contribution to the transition amplitude. According to (8.140, 
8.141) this is 



H*g ) (t)) = -^V / dt' dt" (n\V D (t') \m) (m\V D (t") |0) . (8.149) 
n m =o Jt o J to 



Using the definition of Vd stated in (8.128) one obtains 

<fc|Vu(*)|^> = (fc|Vs(t)|*> exp 
and, employing the perturbation (8.143), yields 

(nltfgty)) = - to lim lim Y dt' dt" 

{n\V\ \m) (m\V\ |0) exp — (e n — e m — ftcji — ihX)t f 

+ (n|V2 |m) (m|V2 |0) exp - (e„ - e m =F ^2 - ih\)t' 



h 



(efe - e*) 



(8.150) 



(8.151) 



exp 
exp 



(e m — e G — Tit^i — ih\)t" 
(e m - e Q =F ^2 - iftA)i" 
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+ (n\Vi \m) (m\V 2 |0) exp 
+ (n\V 2 \m) (m\Vi |0) exp 



(Cn — Cm — fa^l — ih\)t' exp 

(e n - e m T ^2 — i7L\)i' exp 
Carrying out the integrations and the limit lim t _ > _ 00 provides the result 

j oo 

—j lim ^ 

ft 2 A^o+ ^ 



(e m — e c =F ^2 - ih\)t" 
(e m — e Q — ZioJi — ih\)t" 



(8.152) 



m=0 



+ 
+ 
+ 



{n\Vi \m) {m\Vi |0> exp[| (e„ - e G - 2fiwi - 2^A)t] 
6 m — e c — hwi — ih\ e„ — e G — 2/kJi — 2i^A 

(n|Vs |m) (m|F 2 |0) exp[i (e n - e D T 2/iw 2 - 2i/iA)t] 



e m — e D =F f^>2 — ihX e n — e Q =F 2^2 — 2iftA 
{n\Vi \m) (m\V 2 |0) exp [J (e n - e - faji T ^2 - 2^A)t] 

e m — e D =F ^2 — i^A e ra — e D — hw\ =F ^2 — 2ih\ 
(n\V 2 \m) (m\Vi |0) exp[| (e n — e — ftu;i T ^2 — 2i/iA)t] 

e m — e D — fe^i — i/iA 



e« — Co — fa^i T ^2 — 2ih\ 



1st Order Radiative Transitions 

The 1st and 2nd order transition amplitudes (8.148) and (8.152), respectively, provide now the 
transition probability po^ n (t) according to Eq. (8.131). We assume first that the first order transi- 
tion amplitude (n\^^\t)) is non-zero, in which case one can expect that it is larger than the 2nd 

order contribution (n\^^\t)) which we will neglect. We also assume for the final state n / such 
that (n|0) = and 

p^ n {t) = \(n\*%\t))\ 2 



holds. Using (8.148) and 



\zi + z 2 \ 2 = \zi\ 2 + \z 2 \ 2 + 2Re{z lZ2 ) 



(8.153) 



.154) 



yields 



lim e 2Xt 



+ 2 Re 



(n\Vi\0)\ 2 
^ (e + huJi- e n ) 2 + (Aft) 2 

, H^|o)| 2 

(e G ± Hun - e n ) 2 + (Aft) 2 
(#)(#) exp[|(±^ 2 - fu^yt] 
(e G + hui — e n + iXh) (e ± hio 2 — e n — iXh) 



(8.155) 



We are actually interested in the transition rate, i.e., the time derivative of po-m(i)- For this rate 
holds 
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d 
dt 



2\t 2A (n|Vi|0)| 2 

a^o+ (e G + hwi - e n y + (Xhy 



+ 



2 A (n\V 2 \0)\< 



(8.156) 



x 2Rc 



{e ±nw 1 -e n y + (\h) 2 + { 2X + dt 

(n|Vi|0)(0|y 2 |n) exp[^(±hu} 2 - hujt] 
(e + Hloi - e n + iXH) (e Q ± fttJ2 - e n — iXK) 



The period of electromagnetic radiation absorbed by electronic systems in atoms is of the order 
1CT 17 s, i.e., is much shorter than could be resolved in any observation; in fact, any attempt to do 
so, due to the uncertainty relationship would introduce a considerable perturbation to the system. 
The time average will be denoted by ( • • • )t- Hence, one should average the rate over many periods 
of the radiation. The result of such average is, however, to cancel the third term in (8.156) such 
that the 1st order contributions of the two planar waves of the perturbation simply add. For the 
resulting expression the limit lim,\->o+ can be taken. Using 



lim — - 



7T S(x) 



(8.157) 



one can conclude for the average transition rate 



(n|Vi|0)| 2 5(e n - to - Hud 



+ |(n|y 2 |0)| 2 ,5(e n - e T ^2) 



(8.158) 



Obviously, the two terms apearing on the rhs. of this expression describe the individual effects of 
the two planar wave contributions of the perturbation (8.142-8.144). The (5-functions appearing in 
this expression reflect energy conservation: the incoming plane wave contribution of (8.143, 8.144), 
due to the vector potential 



Li u\ exp 



i(k 



1 • r 



Ul t) 



(8.159) 



leads to final states |n) with energy e n — e c + f\u\. The second contribution to (8.158), describing 
either an incoming or an outgoing plane wave due to the vector potential 



A 2 u 2 exp i (hi ■ f =F ^2 *) 



.160) 



leads to final states \n) with energy e n — e D i hL0 2 - The result supports our definition of incoming 
and outgoing waves in (8.31) and (8.142) 

The matrix elements (n|Vi|0) and (n|V2|0) in (8.158) play an essential role for the transition rates 
of radiative transitions. First, these matrix elements determine the so-called selection rules for the 
transition: the matrix elements vanish for many states |n) and |0) on the ground of symmetry and 
geometrical properties. In case the matrix elements are non-zero, the matrix elements can vary 
strongly for different states |n) of the system, a property, which is observed through the so-called 
spectral intensities of transitions |0) — > |n). 
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2nd Order Radiative Transitions 

We now consider situations where the first order transition amplitude in (8.153) vanishes such that 
the leading contribution to the transition probability po^, n (t) arises from the 2nd order amplitude 
(8.152), i.e., it holds 

Po ^n(t) = \(n\^\t)}\ 2 . (8.161) 

To determine the transition rate we proceed again, as we did in the the case of 1st order transitions, 
i.e., in Eqs. (8.153-8.158). We define 



and, similarly, 



z\ 



y. (n\Vj \m) (m|Vj |0) 



m=0 



fvjj\ — ih\ I 

exp[^ (e n - e Q - 2hwi - 2ihX)t] 
e n — e Q — 2huj\ — 2ih\ 



(8.162) 



Z 2 



Z3 



It holds 
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m=0 



(n\V 2 \m) (m\V 2 |0) 
e m - e Q =F ft^2 - ihX 



exp [| (e„ - e D =F 2/iu;2 - 2zftA)i] 



oo 

£ 

m=0 

X 



e n - 6 a T 2hu>2 — 2ih\ 
(n\V 2 \m)(m\Vi\0) {n\V 2 \m){m\Vi |0) 



+ 



\ £m — £o — — iftA e m — e =F ^2 — i^A 
exp (e n — e D — fta>i =F ^2 — 2z/iA)i] 
£n — £o — foji T ^2 — 2z^A 



R + ^2 + ^3 



kl| 2 + |^2| 2 + |^3 1 2 + ^ ZjZfc 



j,k=l 



(8.163) 
(8.164) 

(8.165) 
(8.166) 



In this expression the terms | Zj \ 2 exhibit only a time dependence through a factor e 2Xt whereas the 
terms ZjZ*k for j ^ k have also time-dependent phase factors, e.g., exp[^(±u;2 — Time average 
{ • • • )t of expression (8.166) over many periods of the radiation yields (exp[|(±c<J2 — ^i)])t = 
and, hence, 

(\zi + z 2 + z 3 \ 2 ) t = \z x \ 2 + \z 2 \ 2 + |z 3 | 2 



(8.167) 



Taking now the limit limA^o+ an d using (8.157) yields, in analogy to (8.158), 



2tt 
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m=0 



{n\Vi \m) (m\Vi |0) 
e m — e G — feji — i/iA 



S(e m - e - 2hwi) 



(8.168) 



absorption of 2 photons hoj\ 
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g (n\V 2 \m) (m\V 2 |0> 



m=0 



e Q =F Hlu 2 — 



S(e m ~ e G T 2^w 2 ) 



absorption/emission of 2 photons ^2 
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m=0 



{n\V 2 \m){m\Vi |0) 

— £o — fa^l — ih\ 
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{n\V 2 \m)(m\Vi |0) 

:m - Co T ^2 - 



<^(e n - e G - ^i T ^2) 



absorption of a photon Tioji and absorption/emission of a photon Hu 2 



This transition rate is to be interpreted as follows. The first term, according to its (5-function factor, 
describes processes which lead to final states |n) with energy e n = e Q + 2Huji and, accordingly, 
describe the absorption of two photons, each of energy huj\. Similarly, the second term describes 
the processes leading to final states |n) with energy e n = e Q ± 2f)w 2 and, accordingly, describe 
the absorption/emission of two photons, each of energy hjj 2 . Similarly, the third term describes 
processes in which a photon of energy hu\ is absorbed and a second photon of energy hw 2 is 
absorbed/emitted. The factors | ••• | 2 in (8.168) describe the time sequence of the two photon 
absorption/ emission processes. In case of the first term in (8.168) the interpretation is 

00 ^ 

V (n\Vi\m) — (m|Vi|0) (8.169) 

^ * v ' e m - to- nuji - ih\ v v ' 

T)6I*t \Tl) ^ TTi/ T)GI"t 777^) ^ 0/ 

virtually occupied state \m) 



, i.e., the system is perturbed through absorption of a photon with energy htoi from the initial 
state |0) into a state \m); this state is only virtually excited, i.e., there is no energy conservation 
necessary (in general, e m 7^ e c + hui) and the evolution of state \m) is described by a factor 
l/( e m — e G — fiw\— ihX); a second perturbation, through absorption of a photon, promotes the system 
then to the state |n), which is stationary and energy is conserved, i.e., it must hold e n = e Q + 2hu}\. 
The expression sums over all possible virtually occupied states \m) and takes the absolute value of 
this sum, i.e., interference between the contributions from all intermediate states \m) can arise. The 
remaining two contributions in (8.168) describe similar histories of the excitation process. Most 
remarkably, the third term in (8.168) describes two intermediate histories, namely absorption/ 
emission first of photon Huj 2 and then absorption of photon huj\ and, vice versa, first absorption of 
photon Hloi and then absorption/ emission of photon ftw 2 . 



8.7 One-Photon Absorption and Emission in Atoms 

We finally can apply the results derived to describe transition processes which involve the absorption 
or emission of a single photon. For this purpose we will employ the transition rate as given in 
Eq. (8.158) which accounts for such transitions. 
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Absorption of a Plane Polarized Wave 

We consider first the case of absorption of a monochromatic, plane polarized wave described through 
the complex vector potential 



A(r,t) 



8nMh 
loV 



u exp 



h 



(k ■ f — cot) 



(8.170) 



We will employ only the real part of this potential, i.e., the vector potential actually assumed is 
A(r,t) = 



u exp 



in 



(k ■ r — ut) 



+ 



u exp 



h 



(—k ■ f + oot) 



The perturbation on an atomic electron system is then according to (8.143, 8.144) 



V S 



V\ exp(—iu>t) + Vz exp(+itot) 



e xt , A 0+ , t 



-oo 



where 



V h2 = 



m 



e _ 27rNh . ±l i 



loV 



p ■ u e 



(8.171) 



(8.172) 



(8.173) 



Only the first term of (8.143) will contribute to the absorption process, the second term can be 
discounted in case of absorption. The absorption rate, according to (8.158), is then 



k. 



abs 



2tt e 2 ItiNK 
h ml ojV 



u ■ (n\pe lh ' r |0) 5(e n - e — huj) 



Dipole Approximation We seek to evaluate the matrix element 

M = (n\£e i% - r \0) . 



(8.174) 



(8.175) 



The matrix element involves a spatial integral over the electronic wave functions associated with 
states \n) and |0). For example, in case of a radiative transition from the Is state of hydrogen to 
one of its three 2p states, the wave functions are (n, £, m denote the relevant quantum numbers) 



i/) n =i,e=o,m=o(r,0,(f>) 

1pn=2,e=l,m(r, 9, 4>) 



2 1 h e " Ky oo(M) Is 



1 / 6 r 

al a 



e- rt2a °Yi m (Q^) 2p 



(8.176) 
(8.177) 



and the integral is 

M = 



^ / r 2 dr [ dcosO [ d4>re~ r ^ a ° Y^ m (9, 

o JO J-l JO 

xVe^e- r / a °Y 00 (6,(P) 



(8.178) 



These wave functions make significant contributions to this integral only for r-values in the range 
r < 10a o . However, in this range one can expand 



ik-r ^ 



1 + ik • r + 



.179) 
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One can estimate that the absolute magnitude of the second term in (8.179) and other terms are 
never larger than 207ra o /A. Using \k\ = 2ir/\, the value of the wave length for the Is — > 2p 
transition 

A = /J^ = 1216 A (8.180) 

and a = 0.529 A one concludes that in the significant integration range in (8.178) holds e ~ 
1 + O(^j) such that one can approximate 

e ilr « 1 . (8.181) 
One refers to this approximation as the dipole approximation. 

Transition Dipole Moment A further simplification of the matrix element (8.175) can then be 
achieved and the differential operator p = j\7 replaced by by the simpler multiplicative operator 
r. This simplification results from the identity 

m 

p = ^[r,H ] (8.182) 
where H a is the Hamiltonian given by (8.104) and, in case of the hydrogen atom, is 

H Q = g£ + V(r) ,V(f)=-^. (8.183) 
For the commutator in (8.182) one finds 

[f,H ] = [f, £-] + [r,V(r]] 

= 

1 3 1 3 

e k=l e k=l 

Using f = Yl"j=i x j&j an< ^ the commutation property [x k , pj] = ihdkj one obtains 

H 3 H 3 ft 

[f, H ] = — Pk ej5 jk = — V] p k e k = — p (8.185) 

j,k=l j,k=l 

from which follows (8.182). 

We are now in a position to obtain an alternative expression for the matrix element (8.175). Using 
(8.181) and (8.182) one obtains 

M « | (n| [r, H ] |0) = m{e ° m €w) (n\r |0) . (8.186) 



Insertion into (8.174) yields 



4tt 2 e 2 AA 



V 



2 



u-(n\f\0) 5{e n -e -hw) (8.187) 
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where we used the fact that due to the 5-function factor in (8.174) one can replace e n — e Q by Huj. 
The 5-function appearing in this expression, in practical situations, will actually be replaced by a 
distribution function which reflects (1) the finite life time of the states |n), |0), and (2) the fact 
that strictly monochromatic radiation cannot be prepared such that any radiation source provides 
radiation with a frequency distribution. 



Absorption of Thermal Radiation 

We want to assume now that the hydrogen atom is placed in an evironment which is sufficiently hot, 
i.e., a very hot oven, such that the thermal radiation present supplies a continuum of frequencies, 
directions, and all polarizations of the radiation. We have demonstrated in our derivation of the 
rate of one-photon processes (8.158) above that in first order the contributions of all components of 
the radiation field add. We can, hence, obtain the transition rate in the present case by adding the 
individual transition rates of all planar waves present in the oven. Instead of adding the components 
of all possible k values we integrate over all k using the following rule 

EE - v/m/^E <«»»> 

k u u 

Here J dk is the integral over all orientations of k. Integrating and summing accordingly over all 
contributions as given by (8.187) and using kc = uj results in the total absorption rate 



u 



where the factor 1/K arose from the integral over the ^-function. 

In order to carry out the integral J dk we note that u describes the possible polarizations of the 
planar waves as defined in (8.31-8.35). k and u, according to (8.33) are orthogonal to each other. 
As a result, there are ony two linearly independent directions of u possible, say u\ and U2- The unit 
vectors ui, U2 and k can be chosen to point along the x±, X2, X3-axes of a right-handed cartesian 
coordinate system. Let us assume that the wave functions describing states |n) and |0) have been 
chosen real such that p = (n|r|0) is a real, three-dimensional vector. The direction of this vector 
in the ui, 112, k frame is described by the angles tp, the direction of u\ is described by the 
angles #i = iv/2,<pi = and of v,2 by $2 = 7>"/2,y?2 = tt/2. For the two angles a = Z(ui,p) and 
(3 = Z(u2,p) holds then 

cosa = cosi?i cosi? + sin^i sin^cos(99i — 99) = sin-^cos^ (8.190) 

and 

cos/3 = cos-i?2 cos-i? + sini?2 sini?cos(</?2 — = suidsinip . (8.191) 
Accordingly, one can express 

\ u ■ (n|r|0)| 2 = |p| 2 (cos 2 a + cos 2 /3) = sin 2 # . (8.192) 



and obtain 



J die I H^l°) 2 = \P\ 2 J q J dcostf(l - cos 2 ^) = y (8.193) 
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This geometrical average, finally, can be inserted into (8.189) to yield the total absorption rate 

^ab° = 3 ^ | (n|f|0) | 2 , M w photons before absorption. (8.194) 

For absorption processes involving the electronic degrees of freedom of atoms and molecules this 
radiation rate is typicaly of the order of 10 9 s _1 . For practical evaluations we provide an expression 
which eliminates the physical constants and allows one to determine numerical values readily. For 
this purpose we use uj/c = 2ir/\ and obtain 

4e2 " 3 32 ^ e2 ^ 1.37xl0 19 ix^ (8.195) 



and 



3c 3 h 3 a h\ 3 ' s A 3 



(8.196) 



where 

2-Kch 

A = (8.197) 

The last two factors in (8.194) combined are typically somewhat smaller than (1 A/1000 A) 3 = 
10~ 9 . Accordingly, the absorption rate is of the order of 10 9 s^ 1 or 1/nanosecond. 



Transition Dipole Moment The expression (8.194) for the absorption rate shows that the 
essential property of a molecule which determines the absorption rate is the so-called transition 
dipole moment |(n| r |0)|. The transition dipole moment can vanish for many transitions between 
stationary states of a quantum system, in particular, for atoms or symmetric molecules. The 
value of |(n| f |0)| determines the strength of an optical transition. The most intensely absorbing 
molecules are long, linear molecules. 



Emission of Radiation 

We now consider the rate of emission of a photon. The radiation field is described, as for the 
absorption process, by planar waves with vector potential (8.171) and perturbation (8.172, 8.173). 
In case of emission only the second term Vi exp(+iwt) in (8.173) contributes. Otherwise, the 
calculation of the emission rate proceeds as in the case of absorption. However, the resulting total 
rate of emission bears a different dependence on the number of photons present in the environment. 
This difference between emission and absorption is due to the quantum nature of the radiation field. 
The quantum nature of radiation manifests itself in that the number of photons N w msut be an 
integer, i.e., f\f u = 0, 1, 2 , . . .. This poses, however, a problem in case of emission by quantum 
systems in complete darkness, i.e., for J\f w = 0. In case of a classical radiation field one would 
expect that emission cannot occur. However, a quantum mechanical treatment of the radiation field 
leads to a total emission rate which is proportional to J\f u + 1 where N w is the number of photons 
before emission. This dependence predicts, in agreement with observations, that emission occurs 
even if no photon is present in the environment. The corresponding process is termed spontaneous 
emission. However, there is also a contribution to the emission rate which is proportional to N u 
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which is termed induced emission since it can be induced through radiation provided, e.g., in lasers. 
The total rate of emission, accordingly, is 

4e 2 uj 3 

1;^'' = | (n|f|0) | 2 (spontaneous emission) 

4e 2 uj 3 

+ Mu ~z- | (n|f|0) | (induced emission) 



3c 3 /j 

4 

~3c 3 h 



= (-^ + 1)^^ IHr10)| 2 (8.198) 

Nu photons before emission. (8.199) 



Planck's Radiation Law 

The postulate of the M w + 1 dependence of the rate of emission as given in (8.198) is consistent 
with Planck's radiation law which reflects the (boson) quantum nature of the radiation field. To 
demonstrate this property we apply the transition rates (8.195) and (8.198) to determine the 
stationary distribution of photons Hlo in an oven of temperature T. Let N Q and N n denote the 
number of atoms in state |0) and |n), respectively. For these numbers holds 

N n I N = exp[-(e n - e )/k B T] (8.200) 

where k B is the Boltzmann constant. We assume € n — e — Huj. Under stationary conditions 
the number of hydrogen atoms undergoing an absorption process |0) — > \n) must be the same as 
the number of atoms undergoing an emission process |n) — ► |0). Defining the rate of spontaneous 
emission 

ks P = | <n|f|0> | 2 (8.201) 

the rates of absorption and emission are M^kgp and {N u + l)k sp , respectively. The number of atoms 
undergoing absorption in unit time are ftfuk sp N and undergoing emission are (ftfu+l)k sp N n . Hence, 
it must hold 

k sp N = (Afv + 1) k sp N n (8.202) 



It follows, using (8.200), 
This equation yields 



exp[-huj/k B T} = u . (8.203) 
■Nu) + 1 



i.e., the well-known Planck radiation formula. 



8.8 Two-Photon Processes 

In many important processes induced by interactions between radiation and matter two or more 
photons participate. Examples are radiative transitions in which two photons are absorbed or 
emitted or scattering of radiation by matter in which a photon is aborbed and another re-emitted. 
In the following we discuss several examples. 
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Two-Photon Absorption 

The interaction of electrons with radiation, under ordinary circumstances, induce single photon 
absorption processes as described by the transition rate Eq. (8.187). The transition requires that the 
transition dipole moment (n| r |0) does not vanish for two states |0) and \n). However, a transition 
between the states |0) and |n) may be possible, even if (n|r|0) vanishes, but then requires the 
absorption of two photons. In this case one needs to choose the energy of the photons to obey 



= e. 



+ 2hcu . 



(8.205) 



The respective radiative transition is of 2nd order as described by the transition rate (8.168) where 
the first term describes the relevant contribution. The resulting rate of the transition depends on 
J\f%. The intense radiation fields of lasers allow one to increase transition rates to levels which can 
readily be observed in the laboratory. 

The perturbation which accounts for the coupling of the electronic system and the radiation field is 
the same as in case of 1st order absorption processes and given by (8.172, 8.173); however, in case 
of absorption only V\ contributes. One obtains, dropping the index 1 characterizing the radiation, 



2vr 



e 2 
_ 



m 



E 



(n\u ■ p e %k ' r \m) (m\u ■ p e' lk ' r |0) 



m=0 

x 5(e m - e Q - 2hw) . 



e m — e Q — hui — ihX 



(8.206) 



Employing the dipole approximation (8.181) and using (8.182) yields, finally, 



V 



2 8vr 3 e 4 



h 



E 



(en - e m ) u ■ (n\f\m) (e m -e )u- (m\f\0) 



m=0 

x 8(e m - e - 2hw) . 



hw (e m — e Q — Huj — ih\ ] 



(8.207) 



Expression (8.207) for the rate of 2-photon transitions shows that the transition |0) — > |n) becomes 
possible through intermediate states \m) which become virtually excited through absorption of a 
single photon. In applying (8.207) one is, however, faced with the dilemma of having to sum over 
all intermediate states \m) of the system. If the sum in (8.207) does not converge rapidly, which is 
not necessarily the case, then expression (8.207) does not provide a suitable avenue of computing 
the rates of 2-photon transitions. 



Scattering of Photons at Electrons — Kramers-Heisenberg Cross Section 

We consider in the following the scattering of a photon at an electron governed by the Hamiltonian 
H a as given in (8.104) with stationary states \n) defined through (8.106). We assume that a planar 
wave with wave vector k\ and polarization u±, as described through the vector potential 

A(r, t) = A ol fix cos(A?i • f - Wl t) , (8.208) 

has been prepared. The electron absorbs the radiation and emits immediately a second photon. 
We wish to describe an observation in which a detector is placed at a solid angle element dO,2 = 
sin#2 d02 d(p2 with respect to the origin of the coordinate system in which the electron is described. 
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We assume that the experimental set-up also includes a polarizer which selects only radiation with 
a certain polarization u 2 . Let us assume for the present that the emitted photon has a wave vector 
k 2 with cartesian components 

(sin#2 cos <fo \ 
sin<9 2 sin0 2 (8.209) 
cos6> 2 / 

where the value of k 2 has been fixed; however, later we will allow the quantum system to select 
appropriate values. The vector potential describing the emitted plane wave is then 

A(r, t) = A o2 u 2 cos(A? 2 ■ f - c^t) . (8.210) 

The vector potential which describes both incoming wave and outgoing wave is a superposition of 
the potentials in (8.208, 8.210). We know already from our description in Section 8.6 above that 
the absorption of the radiation in (8.208) and the emission of the radiation in (8.210) is accounted 
for by the following contributions of (8.208, 8.210) 

A(f,t) = A^uiexp[i(ki -f - oJit)} + A~ 2 u 2 exp[z (k 2 ■ f - uj 2 t) ] . (8.211) 

The first term describes the absorption of a photon and, hence, the amplitude is given by 



^ - fW < 8 - 2i2 > 

where A/i/V is the density of photons for the wave described by (8.208), i.e., the wave characterized 
through ki, u\. The second term in (8.211) accounts for the emitted wave and, according to the 
description of emission processes on page 229, the amplititude A~ 2 defined in (8.211) is 



A _ _ (g 213) 

where A/2/V is the density of photons characterized through k 2 ,U2- 

The perturbation which arises due to the vector potential (8.211) is stated in Eq. (8.105). In the 
present case we consider only scattering processes which absorb radiation corresponding to the 
vector potential (8.208) and emit radiation corresponding to the vector potential (8.210). The 
relevant terms of the perturbation (8.105) using the vector potantial (8.211) are given by 

Vs(t) = • l^uiexp^fei • r- LOit)} + A~ 2 u 2 exp[-i(k 2 ■ f- u 2 i)]} 

N — v ' 

contributes in 2nd order 

e 2 -* 
+ —A+ oX A o2 u x ■ n 2 exp{i[(A; 1 - k 2 ) ■ r - (ui - to 2 )t}} (8.214) 

V v ' 

contributes in 1st order 



The effect of the perturbation on the state of the electronic system is as stated in the perturbation 
expansion (8.141). This expansion yields, in the present case, for the components of the wave 



8.8: Two-Photon Processes 



233 



function accounting for absorption and re-emission of a photon 
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(8.215) 



r t ft' 

X dt' dt" ^(^-^m-^i+ifl^t' ' e i(e m -e +frw2+ih\)t" 

J t J t 

+ U2 • {n\ p\m) u\ ■ (m\ p\0) x 
r t' 

dt' / dt" e^i^m+^a+jTiA)*' ' e t(e m -e -huii+ih\)t" 

to J t 

We have adopted the dipole approximation (8.181) in stating this result. 

Only the second (1st order) and the third (2nd order) terms in (8.215) correspond to scattering 
processes in which the radiation field 'looses' a photon Huji and 'gains' a photon hu)2- Hence, only 
these two terms contribute to the scattering amplitude. Following closely the procedures adopted in 
evaluating the rates of 1st order and 2nd order radiative transitions on page 222-225, i.e., evaluating 
the time integrals in (8.215) and taking the limits lim^-oo and lim A ^ + yields the transition rate 



k = 



2tt 



${e n - e - fvjji + hiV2) 



~^2 A tl A o2 «1 -«2 HO) 



(8.216) 
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e Q + hw2 



+ 



ftwi 



We now note that the quantum system has the freedom to interact with any component of the 
radiation field to produce the emitted photon hio 2 - Accordingly, one needs to integrate the rate as 
given by (8.216) over all available modes of the field, i.e., one needs to carry out the integration 
V(27r)~ 3 J k\dk,2 ■ ■ ■■ Inserting also the values (8.212, 8.213) for the amplitudes and A~ 2 results 
in the Kramers-Heisenberg formula for the scattering rate 
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(8.217) 
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Here r Q denotes the classical electron radius 

„2 



Tn = 



= 2.8 • l(T 15 m . 



m e c z 



(8.218) 
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The factor M\cjV can be interpreted as the flux of incoming photons. Accordingly, one can relate 
(8.217) to the scattering cross section defined through 

rate of photons arriving in the the solid angle element d£l 2 

da = — . — (8.219) 

flux of incoming photons 



It holds then 

~ I tin 

ui • u 2 (n|0) (8.220) 



da = r 2 — {M 2 + 1) dn 2 

Ll>i 



{n\u\-p \m){m\u 2 -p |0) (n\u 2 -p \m)(m\ui-p |0) 



me m V e m - e Q + fuv 2 e m - e Q - hu x 

In the following we want to consider various applications of this formula. 

Rayleigh Scattering 

We turn first to an example of so-called elastic scattering, i.e., a process in which the electronic 
state remains unaltered after the scattering. Rayleigh scattering is defined as the limit in which the 
wave length of the scattered radiation is so long that none of the quantum states of the electronic 
system can be excited; in fact, one assumes the even stronger condition 

hw\ « \e Q — e m \ , for all states \m) of the electronic system (8.221) 

Using |n) = |0) and, consequently, u\ = u 2 , it follows 

da = rl(M 2 + l)dn 2 |{ti • u 2 - S{hu)\ 2 (8.222) 

where 

JL_ y^/ (0|fli-p|m)(m|M 2 -p|0) + (0\u2-p |m)(m|Mrp|0) \ (8 223) 

Condition (8.221) suggests to expand S(Huj) 



Using 



S(Hu) = S{0) + S'(0)hw + ^S"{0){hu) 2 + ... (8.224) 

1 1 Hu (hu) 2 

+ — + ... (8.225) 



Cm Co i hio e m e G (e m ^o) 2 (Cm Co) 
one can readily determine 
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me (e m - e G ) 3 m e (e m - e ) 3 



(8.226) 



(8.227) 



(8.228) 
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These three expressions can be simplified using the expression (8.182) for p and the expression 
(8.108) for the identity operator. 

We want to simplify first (8.226). For this purpose we replace p using (8.182) 

(0\ui-p\m) 1 , , „ .. . (m\u\-p\Q) 1 , 

(0|u r r|m) , x 1 F 1 ' = - — (m|ui-r |0) (8.229) 



m e (e m - e Q ) ih ' m e (e m - e Q ) ih 

This transforms (8.226) into 

SYO) = 4y^( <0|«i-f|m)(m|«2-^|0) - (0|« 2 -^|m)(m|«i-r 10) ) (8.230) 
in ^— ' 

m 

According to (8.108) this is 

5(0) = (0|u r f u 2 -f - u-i-tfux-r |0) . (8.231) 
The commutator property [xj, p^] = ^<5jfc yields finally 

^3 3 

= ^ 2^ C-"i)j («*2)*r <0|[arj, p fc ]|0> = £ («iM«2) fc = «i-«2 (8.232) 
j,fe=i j,fe=i 

Obviously, this term cancels the u\-u 2 term in (8.222). 

We want to prove now that expression (8.227) vanishes. For this purpose we apply (8.229) both to 
u\ • p and to u 2 • p which results in 

S'(0) = ^^((0|n 2 -f|m)(m|n r r|0) - (Q\u r r\m)(rn\u 2 -r |0) ) . (8.233) 

m 

Employing again (8.108) yields 

777 

S'(0) = (0| [n 2 -r, u r r] |0) = (8.234) 
where we used for the second identity the fact that U\-f and -u 2 -r commute. 

S"(0) given in (8.228) provides then the first non-vanishing contribution to the scattering cross 
section (8.222). Using again (8.229) both for the u\ -p and the to u 2 -p terms in (8.228) we obtain 

S"(Q) = 2me J^f ^°l^ rf \ m )( m \u2-r |0) + (0\u 2 -f\m)(m\u 1 -f\0) \ (8 235) 

We can now combine eqs. (8.224, 8.232, 8.234, 8.235) and obtain the leading contribution to the 
expression (8.222) of the cross section for Rayleigh scattering 

da = rlm 2 e to 4 (M 2 + l)dn 2 x (8.236) 

2 

(0|u*-r \m)(m\u 2 -r |0) : (Oju^-r \m)(rn\u\-r |0) 



/ \U| U 1 -7 | U 2 • / |U/ ^ 



We have applied here a modification which arises in case of complex polarization vectors u which 
describe circular and elliptical polarizaed light. 

Expression (8.236) is of great practical importance. It explains, for example, the blue color of 
the sky and the polarization pattern in the sky which serves many animals, i.e., honey bees, as a 
compass. 
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Thomson Scattering 

We consider again elastic scattering., i.e., |n) = |0) and u>\ = u 2 = uj in (8.220), however, assume 
now that the scattered radiation has very short wave length such that 

u) » \e Q — e m \ , for all states \m) of the electronic system . (8.237) 

The resulting scattering is called Thomson scattering. We want to assume, though, that the dipole 
approximation is still valid which restricts the applicability of the following derivation to 

k x » — , a Bohr radius . (8.238) 
a 

One obtains immediately from (8.220) 

da = r 2 Q [M 2 + 1) dQ 2 \ui • u 2 \ 2 . (8.239) 

We will show below that this expression decribes the non-relativistic limit of Compton scattering. 
To evaluate \u± • u 2 \ 2 we assume that k\ is oriented along the X3-axis and, hence, the emitted 
radiation is decribed by the wave vector 

(sin#2 cos<f> 2 \ 
sin9 2 sin^> 2 (8.240) 
cos6> 2 J 

We choose for the polarization of the incoming radiation the directions along the x\- and the X2-axes 

u? = (Vj , «< 2 > = ( J j (8-241) 

Similarly, we choose for the polarization of the emitted radiation two perpendicular directions 
and u 2 which are also orthogonal to the direction of k 2 . The first choice is 



,(i) k 2 x ki 

q = :5- 

\k 2 x k\\ 




(8.242) 



where the second identity follows readily from k\ = £3 and from (8.240). Since needs to be 
orthogonal to k 2 as well as to the sole choice is 

r . (1) / COS0 2 COS02 

4 2 ) = J x u 2 = cos 2S - m ^ 2 ) (8 .243) 

\k 2 xu ( 2 '\ y _ sin Q 2 

The resulting scattering cross sections for the various choices of polarizations are 



da = r 2 Q (Af 2 + 1) dn 2 x < 



sm 2 4> 2 


for 


Ul 


= «?> 


u 2 


= 4 ] 


COS 2 #2 COS 2 02 


for 


Ul 


= «?> 


u 2 


= 4 2) 


COS 2 (/>2 


for 


Ul 


= u? 


u 2 




cos 2 ^2 sm 2 (f> 2 


for 


Ul 


= u? 


u 2 


= 4 2) 



(8.244) 
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In case that the incident radiation is not polarized the cross section needs to be averaged over 
the two polarization d 
unpolarized radiation 



the two polarization directions and One obtains then for the scattering cross section of 



da = r 2 (H 2 + 1) d£l 2 x < 



I for uo = ft?' 

* (8.245) 

| cos z t^2 for U 2 = U 2 



The result implies that even though the incident radiation is unpolarized, the scattered radiation 
is polarized to some degree. The radiation scattered at right angles is even completely polarized 
along the n^-direction. 

In case that one measures the scattered radiation irrespective of its polarization, the resulting 
scattering cross section is 

2 

^tot = y {M 2 + 1) ( 1 + cos 2 6> 2 ) dn 2 . (8.246) 

This expression is the non-relativistic limit of the cross section of Compton scattering. The Comp- 
ton scattering cross section which is derived from a model which treats photons and electrons as 
colliding relativistic particles is 

,2 /, , \ 2 



where 



^o? = % (A/-2 + 1) ( ^) ( ^ + ^ - *n*<h) dn 2 (8.247) 
uj 2 x - uj^ 1 = —^(1 - cos<9 2 ) (8.248) 



m P c 2 



One can readily show that in the non-relativistic limit, i.e., for c — > oo the Compton scattering 
cross section (8.247, 8.247) becomes identical with the Thomson scattering cross section (8.246). 

Raman Scattering and Brillouin Scattering 

We now consider ineleastic scattering described by the Kramers-Heisenberg formula. In the case 
of such scattering an electron system absorbs and re-emits radiation without ending up in the 
initial state. The energy deficit is used to excite the system. The excitation can be electronic, 
but most often involves other degrees of freedom. For electronic systems in molecules or crystals 
the degrees of freedom excited are nuclear motions, i.e., molecular vibrations or crystal vibrational 
modes. Such scattering is called Raman scattering. If energy is absorbed by the system, one speaks 
of Stokes scattering, if energy is released, one speaks of anti-Stokes scattering. In case that the 
nuclear degrees of freedom excited absorb very little energy, as in the case of excitations of accustical 
modes of crystals, or in case of translational motion of molecules in liquids, the scattering is termed 
Brillouin scattering. 

In the case that the scattering excites other than electronic degrees of freedom, the states |n) etc. 
defined in (8.220) represent actually electronic as well as nuclear motions, e.g., in case of a diatomic 
molecule \n) = \(f)(elect.) n , 4>(vibr.) n ). Since the scattering is inelastic, the first term in (8.220) 
vanishes and one obtains in case of Raman scattering 

da = r 2 (M 2 + 1) — dQ 2 | u 2 • R • ui | 2 (8.249) 
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where R represents a 3 x 3-matrix with elements 



Rjk 




(n| pj \m)(m\ p^ |0) 

e m - e D + hu>2 
- e Q )/h 



+ 



(n\ pk \m)(m\ pj |0) 

Cm — Co — fi^l 



(8.250) 



(8.251) 



We define x • R • y = J2j,k x j R jk Vk- 

In case that the incoming photon energy hu>i is chosen to match one of the electronic excitations, 
e.g., Hwi f» e m — e for a particular state |m), the Raman scattering cross section will be much 
enhanced, a case called resonant Raman scattering. Of course, no singlularity developes in such 
case due to the finite life time of the state \m). Nevertheless, the cross section for resonant Raman 
scattering can be several orders of magnitude larger than that of ordinary Raman scattering, a 
property which can be exploited to selectively probe suitable molecules of low concentration in 
bulk matter. 



Chapter 9 

Many— Particle Systems 



In this chapter we develop the quantummechanical description of non-relativistic many-particle- 
systems. Systems to which this chapter applies appear in many disguises, as electrons in crystals, 
molecules and atoms, as photons in the electromagmetic field, as vibrations and combination of 
electrons and phonons in crystals, as protons and neutrons in nuclei and, finally, as quarks in 
mesons and baryons. The latter systems require, however, relativistic descriptions. 
The interaction among many identical particles gives rise to a host of fascinating phenomena, e.g., 
collective excitations in nuclei, atoms and molecules, superconductivity, the quantum Hall effect. 
A description of the mentioned phenomena requires a sufficient account of the interactions among 
the particles and of the associated many-particle motions. In the following we will introduce the 
rudimentary tools, mainly those tools which are connected with effective single-particle descrip- 
tions. 

9.1 Permutation Symmetry of Bosons and Fermions 

We seek to determine the stationary states of systems of many identical particles as described 
through the stationary Schrodinger equation 



Here \^f) represents the state of the system which in the following will be considered in a repre- 
sentation defined by space-spin-coordinates Xj = (fj,aj). fj and Uj denote position and spin, 
respectively, of the j-th particle. The spin variable for electrons, for example, is aj = ±5. We will 
assume systems composed of N particles, i.e., the particle index j runs from 1 to N. The wave 
function in the space-spin representation is then 



Permutations 

The essential aspect of the systems described is that the particles are assumed to be identical. This 
has the consequence that one cannot distinguish between states which differ only in a permutation 
of particles. In fact, quantum theory dictates that all such states are identical, i.e., are not counted 



H\t>) = E\V) . 



(9.1) 



*(xi,x 2 , . . -x N ) = (xi,x 2 , • • -XjV I*) . 



(9.2) 
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repeatedly like degenerate states. However, not any state can be chosen as to represent the many- 
particle system, rather only states which obey a certain transformation property under permutations 
are acceptable. This property will be established now. In the following sections we will then 
introduce the operators of second quantization which provide a means to ascertain that manipulation 
of wave functions always leave the transformation property under permutations uncompromised. 
The permutations we need to consider for a system of iV particles are the elements of the group Sn, 
the set of all permutations of N objects. The elements P £ Sn can be represented conveniently 
as 2 x N matrices, the top row representing the numbers 1 to TV labelling the N particles under 
consideration, and the second row showing the numbers 1 to N again, but in a different order, 
number k under the entry j of the first row indicating that particle j is switched with particle k. 
Examples for an eight particle system are 

/ 1 2 3 4 5 6 7 8 \ / 1 2 3 4 5 6 7 8 \ 

1 V 12456387 / v 12354678 / 

P2 which affects only two particles, i.e., particles 4 and 5, leaving all other particles unchanged, is 
called a transposition. Transpositions denoted by T(j, k) are characterized by the two indices (j, k) 
of the two particles transposed, i.e., (j, k) = (4, 5) in case of P2. We can then state P2 = T(4, 5). 
We will not discuss here at any depth the properties of the permutation groups Sn, even though 
these properties, in particular, the representations in the space of iV-particle wave functions are 
extremely useful in dealing with iV-particle systems. A reader interested in the quantum mechanical 
description of N particle systems is strongly encouraged to study these representations. For the 
following we will require only two properties of Sn, namely implicitly the group property, and 
explicitly the fact that any P £ Sn can be given as a product of transpositions T(j, k). The latter 
factorization has the essential property that the number of factors is either even or odd, i.e., a 
given P can either only be presented by even numbers of transpositions or by odd numbers of 
transpositions. One calls permutations of the first type even permutations and permutations of the 
second type odd permutations. The group Sn is then composed of two disjunct classes of even and 
of odd permutations. For the permutations P\ and P2 given above holds 

Pi = T(3,4)T(4,5)T(5,6)T(6,3)T(7,8) ; P 2 = T(4,5) (9.4) 

both permutations being obviously odd. The product P1P2 of even and odd permutations P\ and 
P2 obey the following multiplication table: 



P2 \ Pi 


even odd 


even 
odd 


even odd 
odd even 



We finally determine the action of permutations P on the wave functions (9.2). Denoting by P(j) 
the image of j, i.e., in notation (9.3) the j-th index in the second row, one can state 



PiB(xi,X2,...,Xj,...,X N ) = *(sp(l),fp( 2 ), ....... ,X P (j), ... ,fp(JV)) • (9.5) 
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Permutation Symmetry 

A second consequence of the identical character of the particles under consideration is that the 
Hamiltonian must reflect the permutation symmetry. In fact, the Hamiltonian must be of the type 

N N 

H = J>0%) + g E G ^k) (9.6) 

j=l j,k=l 

which describes one-particle interactions and two-particle interactions of the system. G{xj,Xk) is 
symmetric in its two arguments. The terms of the Hamiltonian will be discussed further below. 
Presently, it is essential that the functions F{ ) and G{ , ) are the same for all particles and for all 
pairs of particles, all particles being governed by the same interactions. From this property follows 
that H is independent of any specific assignment of indices to the particles (note that (9.5) implies 
that a permutation effectively changes the indices of the particles) and, hence, the Hamiltonian for 
permuted indices PHP~ l is identical to H. From the latter results 

[H,P] = 0. (9.7) 

This property, in turn, implies that the permutation operators can be chosen in a diagonal repre- 
sentation. In fact, it is a postulate of quantum mechanics that for any description of many-particle 
systems the permutation operators must be diagonal. Since permutations do not necessarily com- 
mute, a diagonal representation can only berealized in a simpler group. In fact, the representation 
is either isomorphic to the group composed of only the '1', i.e., 1 together with multiplication, or 
the corresponding group of two elements 1, —1. The first case applies to bosons, i.e. particles with 
integer spin values, the second group to fermions, i.e., particles with half-integer spin values. In 
the latter case all even permutations are represented by '1' and all odd permutations by '-1'. Ob- 
viously, both groups, i.e., {1} (trivial) and {1,-1}, provide a representation of the group structure 
represented by the multiplication table of even and odd permutations given above. 
The boson and fermion property stated implies that for a system of N bosons holds 



P\q?) = VPe S N . (9.8) 

For fermions holds 

P|tf) = ep |*) VP G S N (9.9) 

where 

/ 1 for P even 
€P = { -1 for P odd • (9 ' 10) 

We will construct now wave functions which exhibit the appropriate properties. 



Fock-Space 

Our derivation will start by placing each particle in a set of S orthonormal single particle states 
(x\k) = 4>k(x), k = 1, 2, . . . S where S denotes the number of available single particle states, usually 
a very large number. The single particle states are assumed to be orthonormal, i.e., for the scalar 
product holds 

(j\k) = S jk . (9.11) 
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The scalar product involves spin as well as space coordinates and is explicitly given by 

(j\k) = J $(*=)&(»=) ■ (9.12) 

where the second factor represents the scalar product between spin states. We first consider or- 
thonormal many-particle wave functions, the so-called Fock-states, which do not yet obey the 
symmetries (9.8, 9.9). In a second step we form linear combinations of Fock states with the desired 
symmetries. 

The Fock states represent N particles which are placed into S states | Ai), | A2), . • • , |Ajv), Xj £ 
{1, 2, ... S} of the type (9.11, 9.12), each particle j into a specific state \Xj), i.e., 

N 

(x 1 ,x 2 ,...x N \V Fock (X 1 ,X 2 ,...,X N )) = H^ixj). (9.13) 

i=i 

These states form an orthonormal basis of iV-particle states. It holds 

N 

(M/ Fock (A / 1 ,A 2 ,...,A^)|^ Fock (A 1 ,A 2 ,...,A JV )) = l[Sy jXj . (9.14) 

Obviously, the states |* Fock (Ai, A2, • • • , Ajv)) do not exhibit the symmetries dictated by (9.8, 9.9). 

Wave Functions with Boson Symmetry in the Occupation Number Representation 

Wave functions with the proper symmetries can be obtained as linear combinations of Fock states. 
A wave function which obeys the boson symmetry (9.8) is 

(xi,x 2 , ■ ■ .x N \^ B (n 1 ,n 2 , . . . ,n s )) = (9.15) 

Ep eA (A 1> ...A JV ) P(xi,x 2 ,...x N \* F ° c \X 1 ,X 2 ,... .X N )) 

Here the states (Ai, A2, . . . .Ajv) on the rhs. is a particular choice of placing N particles consistent 
with the occupation numbers (ni, n 2 , . . . , 715). A(Ai, . . . , Ajv) C Sn is the set of all permutations 
involving only particles j with different Xj, e.g., T(j,k) 6 A(Ai, . . . , Ajv) only if Xj 7^ A&. The 
integers rij are equal to the number of A& in (9.15) with A^ = j, i.e. the rij specify how often 
a single particle state \j) is occupied. The numbers rij, referred to as the occupation numbers, 
characterize the wave function (9.15) completely and, therefore, are essential. The reader is well 
advised to pause and grasp the definition of the rij . 

An important detail of the definition (9.15) is that the sum over permutations does not involve 
permutations among indices j, k, . . . with Aj = A& = .... This detail is connected with the fact 
that a two-particle state of the type \& Fock (A, A) in which two particles occupy the same single- 
particle state I A) does not allow transposition of particles; the reason is that such transposition 
duplicates the state which would, hence, appear severalfold in (9.15). 
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The wave functions (9.15) describe, for example, the photons of the electromagnetic field or the 
phonons in a crystal. These states form an orthonormal basis, i.e., for two sets of occupation 
numbers J\f = (m, ri2, . . . , ns) and j\f' = (n' 1; n 2 , ■ ■ . , n' s ) holds 

S 

<* B (AOl* B (A0> = l[6 n . n ,. (9.16) 

3=1 



Exercise 9.1.1: 

(a) Show that the states (9.15) obey the boson symmetry (9.8). 

(b) Show that for the states (9.15) holds (9.16). 



Wave Functions with Fermion Symmetry in the Occupation Number Representation 

One can construct similarly a wave function which obeys the fermion symmetry property (9.9). 
Such wave function is 

(xi,x 2 , ■ ■ .x N \^ F {ni,n 2 , . . . ,n s )) = 

7= E P eS N epP(xi,x 2 ,...x N \* Fock (\i,\2,--. -Ajv)) (9.17) 

Here the states (Ai, A2, . • • .Ajv) on the rhs. correspond to a particular choice of placing N par- 
ticles consistent with the occupation numbers (ni,n2, ■ • . ,ns)- Using the identity governing the 
determinant of N x iV-matrices A 



N 

det(A) = €p H A jPU) 

PeS N j=i 

this wave function can also be expressed through the so-called Slater determinant 

(xi,x 2 , ■ ■ .x N \ty F (m,n 2 , . . . ,ns)} = 



(9.18) 



1 

/M 



(pxA^i) <f>\ 2 (xi) 

0Ai(»2) (PX 2 (X2) 

4>\Axn) 4>\ 2 (x n ) 



4>x n (xi) 
(f>\ N (x N ) 



(9.19) 



Here, the integers nj denote the occupancy of the single-particle state \ j). The important property 
holds that rij can only assume the two values nj = or nj = 1. For any value nj > 1 two or more 
of the columns of the Slater matrix are identical and the wave function vanishes. 
The fermion states (9.17) describe, for example, electrons in an atom, a molecule or a crystal. These 
states form an orthonormal basis, i.e., for two sets of occupation numbers N = (711,712, • . . ,ns) 
and N' = (n[, n 2 , . . . , n' s ) holds 



<* F (AO|* F (A0> = U 6 ^ 



(9.20) 
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The representation of wave functions (9.15) and (9.17) is commonly referred to as the occupation 
number representation since the wave functions are uniquely specified by the set of occupation 
numbers J\f = (m, ri2, ■ ■ ■ , ns)- 



Exercise 9.1.2: 

(a) Show that the states (9.17) obey the fermion symmetry (9.9). 

(b) Show that the states (9.17) obey the orthonormality property (9.20). 



9.2 Operators of 2nd Quantization 

An important tool to describe mathematically systems of many bosons or fermions, guaranteeing 
the correct permutation properties of the many-particle states, are the so-called operators of 2nd 
quantization. These operators allow one to construct and manipulate many-particle wave functions 
while preserving permutation symmetry. 

Creation and Annihilation Operators for Bosons 

We consider the operator aj defined through 

,*V„...,,„..,» s )={f*>' » ! ' *H . (9.21) 



The factor ^Jnj appears here on the rhs. since both the iV-particle wave function 
\I/ B (ni, . . . , rij, . . . , ns) and the N — 1-particle wave function ^ B (ni, . . . , rij — 1, . . . , ns) are normal- 
ized according to (9.15). For the adjoint operator a)- holds 

a]^ B {n 1 ,...,n j ,...,n s ) = ^/nj+l * B (m, . . . , rij + 1, . . . , n s ) . (9.22) 

The operators cij and a)- obey the commutation properties 

[a j} a k ] = 0, [at, 4] = (9.23) 

K,a[] =5 jk . (9.24) 

The operators aj and aj are referred to as the creation and annihilation operators of bosons for the 
orthonormal single-particle states \j). 

To prove that (9.22) follows from (9.21) we consider the matrix Aj corresponding to aj in the basis 
of many-particle states (9.15). Employing the superindices M 1 and J\f introduced above and using 
the orthonormality property (9.16) one obtains 

S 

(AjW = <* B (AOl ^l* B (A0> = V^fyn,-l II 5 <n k ■ ( 9 - 25 ) 

k=l 
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Let Aj be the matrix adjoint to Aj. One obtains for its matrix elements 

Jn'j ;S n n '.-i Ilfe=l S n k n' = \/nj + \ 5 n >. nj+l Y[k=l ^n'n k • (9.26) 

From this result one can immediately conclude that (9.22) follows from (9.21) and vice versa. 
We will prove the commutation properties (9.24), the properties (9.23) follow in an analogous way. 
It holds for j = k 

(a 3 a] - a]aj)MAf)) = {n j + l - nj )\*(Af)) = |*(AT)> . (9.27) 

Since this holds for any \ty(Af)} it follows [ctj,cij] = 1. For j ^ k follows similarly 

JaA - al aj )MAT)) = (9.28) 
{\Jnj{n k + 1) - ^(n k + l)n i )|*(ni, . . . rij - 1 . . . n k + 1 . . .n s )) = . 



The creation operators a\ allow one to construct boson states ^ B (Af)} from the vacuum state 



Equations (9.27, 9.29) yield the commutation relationship (9.24). 

|0) = I^Vx = 0, n 2 = 0, . . . , n s = 0)) (9.29) 

as follows 



|* B ( m , rc 2 , . . . , ns)) = TT ^=- |0) • (9.30) 
Of particular interest is the operator 

^ = a)a 3 . (9.31) 

This operator is diagonal in the occupation number representation, i.e., for basis states 
|^ B (ni,n 2 ,...,n 5 )) = \^ B (M)). One can readily show using (9.21, 9.22) 

Nj |^ B (AA)) = nj |* B (A0> , (9.32) 

i.e., the eigenvalues of Nj are the occupation numbers nj. One refers to Nj as the occupation 
number operator. The related operator 

s 

N = ( 9 - 33 ) 

3=1 

is called the particle number operator since, obviously, 

S 

N\^ B (n 1 ,n 2 ,...,n s )} = ^ |* B (ni, n 2 , . . . , n s )) 

i=i 

= N\^ B { ni ,n 2 ,...,n s )) . (9.34) 
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In using the boson creation and annihilation operators, in general, one needs to apply only the 
commutation propertes (9.23, 9.24) and the property dj |0) = 0. As long as one starts from a wave 
function with the proper boson symmetry, e.g., from the vacuum state |0) or states |^/ B (AT)), one 
does not need to worry ever about proper symmetries of wave functions, since they are induced 
through the algebra of a j and aj . 



Exercise 9.2.1: The commutation relationships (9.23, 9.24) and aj\0) = imply that the the 
properties (9.21, 9.22, 9.23) hold for the state defined through (9.29, 9.30). Prove this by induction, 
showing that the property holds for rij = and, in case it holds for rij, it also holds for nj + 1. 
Exercise 9.2.2: Show that the boson operators a\ and a,- satisfy 



[oj, f(a})} = — f 



[«!■,/(%)] = - 



df(aj) 



V" 1 daj 



where the operator function is assumed to have a convergent Taylor expansion 

oo ^ 

f(A) = £-/< n >(0)A" ( 9 - 35 ) 



n! 

n=0 



and where the derivative operation is defined through 

df(A) 



= E 7^I)T f {n) (0)A^. (9.36) 



OA ^ in- 1) 

71=1 



Creation and Annihilation Operators for Fermions 

We want to consider now creation and annihilation operators for fermions, i.e., operators which can 
alter the occupancy of the wave function (9.17, 9.19) without affecting the fermion symmetry (9.9). 
Before proceeding in this respect we need to account for the following property of the fermion wave 
function which applies in the case rij,nj + i = 1, i.e., in case that the single particle states \j) and 
\j + 1) are both occupied, 

(xi,x 2 , ■ ■ .x N \^ F (ni, . . .rtj,n j+ i . . .n s )) = (9.37) 
- (xi,x 2 , ■ . .XAr|^ F (ni, . . .,n j+1 ,rij . ..ns)} ■ (9.38) 

Obviously, the fermion wave function changes sign when one exchanges the order of the occupancy. 
To prove this property we notice that the l.h.s. of (9.38) corresponds to 

(x\,X2, ■ ■ .x N \fy F (ni, . . .nj,n j+ i . ..ns)) = 



l 

/M 



(f>j(xi) 
4>j(x 2 ) 



bj+i(xi) 
i>j+i{x 2 ) 



(9.39) 



<t)j{x N ) (f>j + l(x N ) 
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The r.h.s. of (9.38) reads 



(xi,x 2 , ■ . .XAr|* F (ni, . . .,n j+1 ,nj . ..n s )) 



l 

/M 



0j+i(£i) 



^•(xi) 

0jO?2) 



4>j+i(x N ) 4>j(x N ) 



(9.40) 



Because of the property of the determinant to change sign when two columns are interchanged, the 
expressions (9.39) and (9.40) are identical. 

Obviously, it is necessary to specify for a fermion wave function the order in which the single- 
particle states \Xj) are occupied. For this purpose one must adhere to a strict convention: the 
labelling of single-particle states by indices j = 1,2,... must be chosen once and for all at the 
beginning of a calculation and these states must be occupied always in the order of increasing 
indices. A proper example is the two particle fermion wave function 



(x±, X2\^ F (ni = 0, ri2 = 1, nz = 0, 124 = 0, ns 
<fo(£i) 05 (£1 

<p2(x 2 ) 4>b(x2 



1 

V2 



l,n 6 = n 7 = . . .ns = 0)) 
= ^ [02(^1)^5(^2) - 05(^1)02(^2)] • 



(9.41) 



Before we consider the definition of fermion creation and annihilation operators we need to take 
notice of the fact that one also needs to define at which position in the wave function, i.e., at 
which column of the Slater determinant (9.19), particles are being created or annihilated. One 
adopts the convention that particles are created and annihilated by these operators always at the 
first position of the wave function, i.e., at the first column of the Slater determinant (9.19). This 
requires one, however, in order to be consistent with this convention, that occupancies are always 
ordered according to a monotonous increase of the single-particle state index, to move the particle 
to the first position (to be annihilated there) or to move it from the first position to its canonical 
position (after it had been created at the first position). This change of position is connected with 
a possible sign change (—1)* where qj is defined for a given N = (ni, n 2 , ■ ■ ■ ns) as follows: 



j'-i 



1i 



(9.42) 



k=i 



i.e., qj is the number of states \k) with k < j which are occupied in a fermion wave function. 
We will be adopting below the notational convention that q'j corresponds to occupancies jV' = 
(n[,n' 2 , . ..n' s ). 

We are now ready to define the annihilation operator Cj for fermions in the single-particle state \ j) 
as follows 



cj |^ F (ni,n 2 , ...nj. ..n s )) = 
nj (-1)* |^ F (ni,n 2 ,n 3 . ..nj-i ,nj -»■ 0,n j+1 . 

v ' 

qj states occupied 



■■n s )) 



(9.43) 



In case that the single particle state \j) is not occupied, i.e., nj = 0, the rhs. vanishes. 
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The operator cj- adjoint to Cj exhibits the following property 

c] |* F (ni,n 2 , ...rij. ..n s )} = 
(-1)*(1 - rij) |* F (m,n 2 ,n 3 . ..rij-!, rij -> l,n j+1 . ..n s )) (9.44) 

V v ' 

qj states occupied 
The operators thus defined obey the commutation properties 

[ Cj ,c k ] + = 0; [ct,4]+ = (9.45) 

[ Cj ,c{} + =S jk (9.46) 

where we have used the definition of the so-called anti- commutators [A,B]+ = AB + BA. 
We first show that (9.44) follows from the definition (9.43). Let Cj be the matrix corresponding to 
the operator Cj in the basis of fermion states (9.17, 9.19). The elements of Cj are then ( note that 
rij only assumes values or 1) 

s 

(C, W = <* F (^')I ^ |* F (A0> = nj (-1)* 8^.^ J] 8 <nk . (9.47) 

k=i 

Let be the matrix adjoint to Cj. One obtains for its matrix elements 



(Cj)^, = <* F (AO|c]|* F (A0) = (C,W 
s 

= n'j(-l) q 'i 8 n . nl ■ ! ]J 5„ fe< 



fc=i 

s 

= (-1)* (l-n,)5 n>3+i n <5 n , nfc . (9.48) 

k=i 

From this follows (9.44). We have used here the definition q 1 - = Y2k<j n 'k a l° n S with q 1 - = qj, 
otherwise, the last factor on the rhs. of (9.48) vanishes. 

We will prove now the anti-commutation property (9.46). The anti-commutation properties (9.45) 
follow in an analogous way and will not be derived here. We consider (9.46) first for the case j = k. 
It holds 

4 Cj |^ F (A0) = c](-l)Vn j \y F (n 1 ...n j ^0...ns)) = 

(_l)*(l_ nj . + l)(_l)* nj .|*F(^)) = n ^ F (A0) (9.49) 

The derivation involves realization of the fact that a non-zero result is obtained only in case rij = 1. 
Similarly one obtains 

Cj C ]\^ F (jV)) = c j (-l)U(l-n j )\^ F (n 1 ...n j ^l...n s )) = 
(-l)^'(n j + l))(-l)*(l-n i )|^ F (AA)) = (l-rij)\* F (jV)) (9.50) 
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(9.49) and (9.50) yield 

( Cj c] + c] Cj ) |v|/ F (.A0> = |* F (A0>. (9.51) 

For j < k one obtains 

Cj -4|* F (A0) = (-l)v +qk n j (l-n k )\V F (n 1 ...n j = 0...n k = l...ns)) 
c l c il^ F (-^)> = (-l)*"^*" 1 ^^ 1 " n k)\^ F (ni ■ ■ ■ nj = . . . n k = 1 . . . n s )) 

and, hence, 

+ 4 c i)i* F (^o> = °- ( 9 - 52 ) 

One can obtain similarly the same relationship in the case j > k. Since (9.51, 9.52) hold for any 
\^ F (Af)} one can conclude (9.46). 
Equation (9.49) shows that the operator 

Nj = c) Cj (9.53) 

is diagonal in the occupation number representation, i.e., in the basis | \]/ F (JV) ) , with eigenvalues 
equal to the occupation numbers rij 

Nj\V F (Af)) = nj \* F (Af)) (9.54) 

Nj, hence, is referred to as the occupation number operator. Correspondingly, 

s 

N = J2 N J ( 9 - 55 ) 

is called the particle number operator. 

It is an interesting exercise to demonstrate that Nj only has eigenvalues or 1, for which purpose 
one needs to envoke only the algebraic (anti-commutation) properties (9.46). The stated property 
follows from the idempotence of Nj which can be derived as follows 

Nj = c]cj C )cj = ct (1 — c)cj)cj = c]cj - c)c)cjCj = c)cj = Nj . (9.56) 

Here we have made use of CjCj = and c^c^ = which follows from nj G {0, 1}. 

We finally note that the creation operators allow one to construct any fermion state \^ F (Af)) 

from the vacuum state |0) defined as above (see (9.29)) 

|* F (...n Al ...n A2 ...n Ajv ...)> = • ■ ■ c[ n |0) . (9.57) 

On the l.h.s. of this equation we meant to indicate only those N occupation numbers n\. which 
are non- vanishing. On the r.h.s. the creation operators must operate in the proper order, i.e., an 
operator Cj must be on the left of an operator c k for j < k. 

In the following we will apply fermion operators cj- and Cj only to electrons which carry spin 2 • We 
will often separate the states \j), coordinates x and index j into a space part and a spin part, i.e., 

j - j, a (a = ±i) ; (x\j) - ^-(r) \^a) . (9.58) 
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9.3 One— and Two— Particle Operators 

Definition 

Operators acting on N particle systems of the type 

N 

p = EM) ( 9 - 59 ) 

are called single-particle operators. The operators f{x) which constitute F are called the associated 
generic single-particle operators 1 . Operators of the type 

1 N 

G = 2 E 9(xj,x k ) (9.60) 

j,k=l 

correspondingly are called two-particle operators. The operators g(x, x') which constitute G are 
called the associated generic two-particle operators. These operators had been introduced already 
in Eq. (9.6) above. The essential aspect of definition (9.59, 9.60) is that the sum over particles in 
(9.59) and over pairs of particles in (9.60) involves generic operators - acting on Xj and on Xj,Xk, 
respectively - which are all identical. An operator 

N 

r = E^-(^) ( 9 - 61 ) 

3=1 

is not a one particle operator as long as the N operators rj(x), j = 1,2, . . . , N are not all identical. 
We seek to determine now how one-particle operators F and two-particle operators G act on many- 
boson and many-fermion states |* B (AT)) and \^ F (Af)), respectively. The action of the operators 
is obviously described through the many-particle state matrix elements 

(■$ B ' F (N')\F\y B ' F (N)) ; (tf B > F (JSO|G|* B,F (A0> . (9.62) 

These matrix elements can be obtained by chosing the operators and many-particle states in the 
position representation, i.e., (9.59, 9.59) and (9.15, 9.17) and integrating over all particle space-spin 
coordinates x\, X2, . . . xn- This procedure is most tedious and becomes essentially impossible in 
the general case that many-particle basis functions are linear combinations of wave functions of the 
type (9.15, 9.17). 

Since the many-particle states are built-up from a basis |r), r = 1, 2, . . . S of single-particle spin- 
orbital states (see page 241) and the operators are specified through the associated generic operators 
/ and g, one expects that the matrix elements can actually be expressed in terms of matrix elements 
involving solely the generic operators and the single-particle states, namely, 

(r\f\s) = J d 3 rr r (r,a r )f(x)^ s (r,a s ) (9.63) 

lr This expression has been specified for the purpose of these notes to distinguish F and / and is not common 
terminology. We adopt a similar expression to distinguish two-particle operators G and g. 
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(r,s\g\t,u) = j drj d r' ip* (r, a r )tp* s (r', a s ) g(x, x') ip t (r, cr t )ipu(r' , a u ) . (9.64) 

We like to note an important symmetry of the matrix element {r, s\g\t, u) which follows directly 
from the definition (9.64) 

(r,s\g\t,u) = (s,r\g\u,t) . (9.65) 

This symmetry will be exploited repeatedly below. Notice, that the symmetry implies that one can 
switch simultaneously the pairs of indices r, s and t, u, one cannot switch the indices of one pair 
individually. 

The aim of the present section is to express the matrix elements of single- and two-particle operators 
in a basis of many-particle states (9.62) in terms of the matrix elements of single-particle states 
(9.63, 9.64). We will show that the boson and fermion creation and annihilation operators serve 
this purpose. 

Examples of One and Two— Particle Operators 

An example for a single-particle operator is the kinetic energy operator 

j=i x j 
Here we adopt the notation for the Laplacian 

^- = — + — + — • (9.67) 
dr 2 dx\ dx\ dx\ 

In this case the generic single-particle operator, according to (9.59), is t(x) = (—h 2 /2m)(d 2 /dr 2 ). 
The matrix elements of this operator in a single-particle basis are 

<ra|tV> = <W / d 3 rcf>* r (7) (-^j ^sif) • (9.68) 
Another single-particle operator is the one-particle density operator 

N 

P{r) = J>(r-rS-)- (9-69) 

3=1 

In this case the generic operator is fix) = 5(r — r). The matrix elements of the generic operator 
in the one-particle basis are 

(ra\f\sa'} = <W [ d 3 r c/>* r (r) 6(? - r)</) s (r) = 8 aa > fi(r) <j> a (r) 8 aa . (9.70) 



Both operators, i.e., (9.66) and (9.69), are spin-independent as is evident from the factor S aa /. 
An example for a two-particle operator is the Coulomb repulsion operator 



-I N 2 



2 ^ \Tj - r k \ 
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which is also spin-independent. In this case the generic operator is v(x, x 1 ) = q 2 /\r\ — 7^2 1 - The 
matrix elements of the generic operator in terms of single-particle states are 

(r, a; s, a'\ t \t, a"; u, a'") = 
J J d 3 r 2 #(ri)#(f 2 ) Jf^\ Mn)M?2) 5 aa/ ,5 a , a ,„ (9.72) 

As a final example of one- and two-particle operators we consider operators of total spin. The 
operator 

N 

S = Y,Sj (9-73) 
3=1 

is a one-particle operator. Here, Sj is the spin operator for particle j with components (in the 
spherical representation) Sj + , Sj — , Sj t3 . The generic operator is given through the Pauli matrices, 
i.e., s = \a. The non-vanishing matrix elements of the spherical components of s are (a = +|, (3 = 



J rs 



(r,a\s + \s,P) 

(r,P\s-\s,a) = 5 rs 

(r,a\s 3 \s,a) = +\5 TS 

(r,P\s 3 \s,(3) =-!<*„ (9.74) 



The operator 



/ N \ 2 N 

52 = E ^ = E ^ • ^ ( 9 - 75 ) 

\3=1 J j,k=l 

is a two-particle operator, however, not in the strict sense of our definition above since the restriction 
j / k does not apply in the summation. However, one can obviously extract the term ^ • S? to be 
left with a two-particle operator in the strict sense of our definition. The generic operator is 

S12 = Si ■ S 2 = -Si t+ S 2 - + -Si-S 2 ,+ + Si :3 S 2:3 (9.76) 
the single-particle state matrix elements of which are 

(r,a; s,a'\s 12 \t,a"; u,a"') = 8 rt 8 su (\8i_ (T 8_i (J ,,8_i (jl 8i (jlll + 
+ \8_ i a 8i a „8i a ,8_i a . lll + \ 8 t j C n8 c t ( ,iii ( 8 aa i — 8- aa i)\ . 



(9.77) 



The operators (9.74) and (9.77) are orbital-independent as evidenced by the factors 8 rs and 8 r t8 s 
in the respective formulas. 
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Definition in Terms of Creation and Annihilation Operators 

In order to express many-particle state matrix elements (9.62) through single-particle state matrix 
elements (9.63, 9.64) one replaces the operators F in (9.59) and G in (9.60) as follows 

ffe) - ( SHf'^h bOS ° nS (9-78) 
^ { 22r,s=i( r \f\ s ) c rCs fermions 

1 y ^ ^ ( i^ Att=1 <r,^|t,«>4aUa„ bosons 

2^ [ 2 Sr, s ,t,«=i( r > s l5|i,«)c|ck u Ci fermions 

It is crucial to notice in the expression for the fermion two-particle operator that the order of the 
annihilation operators in (9.79) is opposite to that in the matrix element (r, s\g\t,u), namely c u ct, 
and not ctc u . 

Expressions (9.78) and (9.79) have the following implication: The operators (9.78, 9.79) in the 
basis of many-particle states \ty B ' F (J\f)) have the same values as the respective matrix elements 
(9.62). In order to determine these matrix elements one needs to evaluate first the matrix elements 
of the generic operators {r\f\s) (r,s\g\t,u) as defined in (9.63) and (9.64), respectively, and then in 
a second step the matrix elements 

(^ B (AOka s |^ B (A0) , (y B (Ar')\4.ala t a u \y B (Ar)} bosons (9.80) 

(^ F (AO|c]x s |^ F (A0) , (y F (Ar')\ctclc t c u \y F (Ar)} fermions (9.81) 

For the latter matrix elements simple rules can be derived from the algebraic properties of the 
boson and fermion operators (9.23, 9.24) and (9.45, 9.46), respectively. These rules will be provided 
below only for the case of fermions. To show that the resulting values of the matrix elements (9.62) 
are correct one needs to compare the result derived with conventional derivations of the matrix 
elements 2 . 

The Matrix Elements {^> F (M')\cU s \^ F (M)) 

We consider first the case r = s. The matrix elements are then actually those of the number 
operator c\c r which is diagonal in the occupation number representation, i.e., 

S 

<* F (AOl4.c r |* F (A0> = n r H 5 n , nr . (9.82) 



r=l 



In case of r ^ s one obtains 



(* F (AO|c]: Cs |* F (A0> = (9-83) 



<c r * F (AO|c s * F (A0> = n' r .(-ir'rn s (-l)^5 n/r . lnr 8 <ns . l f\ t=1 8 n , 

t+r,s 



2 We refer the reader to Condon and Shortley's 'Theory of Atomic Spectra', pp.171 and pp.176 where the matrix 
elements of fermion states can be found. 
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From (9.82, 9.84) we can conclude that for diagonal elements holds 

s 

(^(AOIFI^AO) = J2n r (r\f\r) (9.84) 

r=l 

The off-diagonal elements with n' r = n r except for r = s,t, in which case n' s = n s + 1 and 
n' t = nt — 1 holds, are 

<* F (AO|F|* F (A0> = (l-n s )(-iy°n t (-iy*(s\f\t) (9.85) 

All other matrix elements vanish, i.e., those for which N' and N differ in more than two occupation 
numbers. Comparision with the results in Condon&Shortley (pp. 171) shows that the operator 
defined in (9.78) does yield the same results as the operator defined in (9.59) when the matrix 
elements are determined between Slater determinant wave functions. 



The Matrix Elements (tf F \M')\cUic t c u \^ ¥ \M)) 

Before we determine these matrix elements we will investigate which possible combination of indices 
r,s,t,u need to be considered. Starting from the definition (9.79) and using (cf^ = 0, c 2 r = 
we can write 

1 S 

^ = 2 X {rs\g\tu)44c u ct (9.86) 

r,s,t,u 

For the combination of indices r,s,t,u in this sum three possibilities exist 
Case two of the four indices are different; 
Case 1 three of the four indices are different; 
Case 2 all four indices are different. 

Accordingly, we split the sum in (9.86) into three contributions, namely, G = Go + G\ + G2, 
where each contribution corresponds to one of the three cases mentioned. 
The first contribution Go is 

1 S 1 5 

G = - X {r s\g\r s)clc\c s c r + - ^ (r s\g\sr)clc\c r c s . (9.87) 

r,s=l r,s=l 

a.i.d. a.i.d. 

Here and in the following we denote by 'a.i.d.' (all indices different) that only tuples 
(r,s), (r,s,t), (r,s,t,u) are included in the summation for which all indices are different, i.e., for 
which holds r 7^ s, r 7^ t etc. The anti-commutation property (9.45) yields together with the 
definition of the occupation number operator (9.53) 

^° = \ E (( rs \9\ rs ) ~ (rs\g\sr))N r N s . (9.88) 

a.i.d. 
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Obviously, this contribution to the two-particle operator is diagonal in the occupation number 
representation. As we will find, this part accounts for all diagonal contributions to G. 
The second contribution G\ is 



A -, Q ft 

G\ = 2^2r,s,t=i(r s\g\rt)& r c l s ctc r 

a.i.d. 

1 S i i 

+ 2Er,s,t=i (r s\g\ts)c\.d s c s ct 

a.i.d. 

1 S i i 

+ 2^2r,s,t=i (r s\g\st)cWsC t c s 

a.i.d. 

+ \ Er,s,t=i (rs\g\tr}cUtc r c t . (9.89) 

a.i.d. 

Commutation of qc u according to (9.45), employing the symmetry property (9.65), relabelling 
summation indices and using c\c\ctc r = c\c\c r ct = c\ctN r yields 



Gi = Yl (i rs \9\ rt ) ~ {rs\g\tr)}clc t N r . 

r,s,t=l ^ ' 



(9.90) 



Since ^ r is diagonal, this contribution obviously behaves similarly to the off-diagonal part (9.84) 
of the one-particle operator F. 

A similar series of transformations allows one to express G2 which can be written 

G 2 = 2E r,s,t,u (r s\g\tu)cWsC u c t 

r>s,t>u 

+ 2 52 S r,s,t,u (r s\g\tu)clc\c u c t 

r>s,t<u 

+ \ T, S r,s,t,u (rs\g\tu)cicU u c t 

r<s,t>u 

+ \ T, S r,s,t,u (r s\g\tu)clc\c u c t (9.91) 



2 

r<s,t<u 



as follows 



G 2 = ( ( rs \9\ tu ) - ( r s\g\ut) j c\c\c u c t . 

V G f. II. ' 



(9.92) 



r,s,t,u 
r>s,t>u 



This contribution to G has non-vanishing matrix elements only when N' differs from N in four 
occupation numbers n r ,n s ,n t ,n u . A similar contribution does not arise for F. 
We can state now the matrix elements (M')\G\^ F (M)} . For this purpose we consider three 
cases which actually correspond to the three cases considered below Eq. (9.86). 

Case This case covers diagonal matrix elements, i.e., M' = N or n' r = n r for all r. Only Go, 
given in (9.88), contributes in this case. One obtains 



<* F (A0|G|* F (A0> = \ J2n r n s ((r, s\g\r, s) - (r, s\g\s, r) \ . 

r,s ^ ' 



(9.93) 
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Case 1 This case covers off-diagonal elements with n' r = n r except for r = s, t for which holds 
n' s = n s + 1 and n' t = nt — 1. Only Gi, given in (9.90), contributes in this case. One obtains 

(^ F (AA')|G|^ F (AA)) = 
(-1)^(1 -n,)nt £ J n r ( (r,s\g\r,t) - (r,s\g\t,r)) (9.94) 

Case 2 This case covers off-diagonal elements with n' r = n r except for r = s,t,u,v for which holds 
n' s = n s + 1, n' t = nt + l,n' u = n u — 1, n' v = n v — 1. Only G2, given in (9.92), contributes 
in this case. One obtains 

(^ F (AA')|G|^ F (A0> = (9.95) 
± (_!)<£ +<£+««+<?„ (1 _ ns )(i _ n r )n u n v ^(r,s\g\u,v) - (r,s\g\v,u)^j 

In the latter formula the '+'-sign applies for s < r, u < v or s > r, u > v , the '—'-sign applies 
for s < r, w > w or s > r, u < v. 

All matrix elements which are not covered by the three cases above vanish. In particular, for 
non-vanishing matrix elements (J\f')\G\^ F (J\f)) at most four occupation numbers n' r and n r can 
differ. Furthermore, particle number is conserved, i.e., it holds Yl r =i n 'r = J2r=i n r- 



Exercise 9.3.1: Derive (9.90) from (9.89). 
Exercise 9.3.2: Derive (9.92) from (9.91). 
Exercise 9.3.3: Derive (9.95) from (9.92). 

Exercise 9.3.4: Show that the matrix elements of F and of G conserve particle number. 
Exercise 9.3.5: Derive the non-vanishing matrix elements (9.62) for bosons. 



Spin Operators 

In this section we will briefly consider the spin operators S given in (9.73) and S 2 given in (9.75) 
which are, respectively, one-particle and two-particle operators. The matrix elements of the cor- 
responding generic operators are (9.74) and (9.76). We like to express these operators through 

fermion creation and annihilation operators. One obtains for the three components of S 

S 



S3 = \ yi - ^) 

r=l 

S 

S + = ^2 c ta c r/3 
r=l 
S 



r=l 
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For S 2 one obtains 

s s 
S 2 = \n + - (Nra - N rP )(N sa - N sP ) - c \J s pCrpc sa . (9.97) 

r,s=l r,s=l 

The summation in (9.96, 9.97) is over the orbital states, the spin part of the single-particle states 
is represented by a = \ and (3 = — \. 

Exercise 9.3.6: Derive (9.96) and (9.97). 



9.4 Independent-Particle Models 

In the remaining part of this chapter we will apply the technique of operators of 2nd quantization 
to the description of many-fermion systems as they arise, e.g., in crystals, molecules, atoms and 
nuclei. In all these systems the Hamiltonian is a sum of one- and two-particle operators 

S s 
H = ^2( r \t\ s ) c l c s + 2 { r i s \v\u,v)clc\c u ct 

r=l r,s,t,u 

In many cases the Hamiltonian is spin-independent and the equivalent notation 

s I s 
H = ^2 (r\i\s)4 a c S(7 + - ^ ( r > s N^> c L c L' c ^' c ^ 

r,s=l r,s,t,u=l 
<r <r,a' 

will be used. In the latter case the indices r, s, t, u refer only to the orbital part of the single-particle 
basis. 

If it were not for the two-particle contribution to the Hamiltonian, which represents the interactions 
between particles, the description of many-particle systems, e.g., evaluation of their stationary 
states and excitation energies, would be a simple exercise in linear algebra. Unfortunately, the 
two-particle operator makes the description of many-particle systems a very hard problem. The 
fortunate side of this is, however, that the two-particle operator representing interactions between 
particles induces a variety of interesting phenomena. 

Actually, the study of problems governed by Hamiltonians of the type (9.98, 9.99) has preoccupied 
an important part of all intellectual efforts in Theoretical Physics during the past fifty years. In 
fact, it is one of the main intellectual achievements of Physics among the Sciences to have addressed 
systematically the study of systems composed of many strongly interacting components. The out- 
come of these studies is that even when interactions between components (particles) are simple, 
the concerted behaviour of interacting systems can deviate qualitatively from that of systems made 
up of independent components. Examples is the laser action, superconductivity, but also ordinary 
phenomena like freezing and evaporation. Often, the systems investigated involve a macroscopic 
number of components such that statistical mechanical approaches are envoked. The great general- 
ity of the concepts developed in the context of many-particle systems can be judged from the fact 



(9.98) 



(9.99) 
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that today these concepts are providing insight into the functioning of biological brains, another 
prototypical systems of many strongly interacting components, namely of neurons 3 . 
Before continuing one may finally point out that the material world as we know it and as it 
establishes the varieties of natural substances, ranging from nuclei to the molecules of living systems 
, ultimately depend in a crucial way on the fermion character of its constituent building blocks, 
electrons and nucleons. If it were'nt for the fermion nature, all material systems would condensate 
into states which would depend little on particle number. The Aufbau principle, according to which 
nuclei and atoms change their qualitative properties when they grow larger, would not exist. The 
electronic properties of atoms with different numbers of electrons would differ little, Chemistry 
essentially would know only one element and Life would not exist. 



Exercise 9.4.1: Imagine that in a closed, water-filled jar all electrons of water turn their fermion 
nature into a boson nature. What might happen? 



Independent-Particle Hamiltonian 

A many-fermion system governed by an independent-particle Hamiltonian, i.e., a Hamiltonian 
without a two-particle operator contribution accounting for interactions among the particles, can 
be described in a rather straightforward way. We will restrict our description in the following to 
systems composed of an even number ( 2N ) of particles and to spin-independent interactions. The 
Hamiltonian of such system is 

s 

Ho = Yl (r\i\s)4 a c sa (9.100) 

r,s=l 
a 

We will denote the matrix elements of i as (r|f|s) = t rs . 



Transformation to a New Set of Creation and Annihilation Operators 

Our aim is to represent the Hamiltonian (9.100) in a form in which the factors c\ a c S u become 
occupation number operators. In such a representation H Q is diagonal and the stationary states 
can be stated readily. To alter the representation of H D we apply a unitary transformation U to 
the single-particle states |r) to obtain a new basis of states \m), i.e., {\m),m = 1,2,... S}, where 

fi- 
lm) = Y,U rm \r) . (9.101) 

r=l 

The unitary property of U rn 

UtU = UUt = 1 (9.102) 

or 

S S 

^ y U * rm U rn = 5 mn , ^ ^ U rm U* sm = 5 rs (9.103) 

r=l m=l 

3 An account of some of these efforts can be found in Modelling Brain Function - The World of Attractor Neural 
Networks by D.J. Amit (Cambridge University Press, New York, 1989) 
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allows one to express conversely |r) in terms of \m) 



One can, hence, expand 



s 

\r) = Y, U rm\m) . (9.104) 

m=l 



^2t mn U rm U; n (9.105) 



where 

S 

tmn = (m\t\n) = V t rs U* rm U sn = (uHu) (9.106) 

r,s=l 

which together with (9.100) yields 

Ho = t mn d jna .d n(7 (9.107) 

where 

S s 

d\i a = ^ ^ Urn^rg i d nu = ^ ^ U rn Cra (9.108) 
r=l r=l 

These operators describe the creation and annihilation operators of fermions in states \n) which 

are linear combinations (9.101) of states \r). The unitary property (9.103) allows one to express 
[c.f. (9.104)] 

s s 

4 a = J2 U *mdL, c sa = J2u sn d na . (9.109) 

m=l n=l 

The operators (9.108) obey the same anti-commutation properties (9.45, 9.46) as c\(j and c r(7 , 
namely, 

ld ma ,d na ,} + = 0; [dl a ,dlj + = (9.110) 

[dma,dl a/ } + = 5 mn 5 aa i (9.111) 
and, accordingly, they are Fermion creation and annihilation operators. In a basis of states 



N 

IK-, i°) ( 9 - n2 ) 

3=1 

, i.e., of iV-Fermion states in which single particle states \rij) as defined in (9.101) are occupied, the 
operators dn a and d na , behave exactly like the operators do- and c ra behave for states \^ F (Af)); for 
example, the expressions drived above for the matrix elements of one- and two-particle operators 
hold in an analogous way for dn a and d n<7 . 

We demonstrate here the anti-commutation property (9.111), the anti-commutation properties 
(9.110) can be demonstrated similarly. The l.h.s. of (9.111) can be written, using (9.108) and 

[dma,dl a ,} + = Y, U * m Usn[Cra,cU + = <W £ U* r JJ rn . (9.113) 
r,s r 

The unitarity property ^ r U* m U rn = 5 mn yields immediately identity (??). 
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Diagonal Representation 

The transformation (9.101) gives us the freedom to choose the matrix (9.106) diagonal, i.e., 

tmn = £n$mn ■ (9.114) 

In this case H Q has the desired form 

S 

H = Ys^ndLdne (9.115) 

n=l 

and involves solely occupation number operators N na = dn a d nr7 . It is, hence, a simple matter to 
state many-particle states in the new representation. 

Before proceeding we like to address the issue how the new representation, i.e., transformation 
matrices U rn and energy values e„, is obtained. The condition (9.114) together with (9.106) is 
equivalent to 

s 

^trgUsn = e n U rn W,r = 1,2, ...S (9.116) 

s=l 

which follows from (9.106) and the unitary property of U rn . This equation poses the eigenvalue 
problem for the hermitian matrix (t rs ). The corresponding eigenvalues e n ,n = 1,2,. ..S are 
real, the associated ( properly normalized) eigenvectors V n are the columns of (U rn ), i.e., = 

(Ui n , U~2n, ■ ■ ■ Usn)- 

Eigenstates of (9.115) can be stated immediately since any many-particle wave function in the 
occupation number representation is suitable. We apply for this purpose (9.57) to the case of 2N 
particles. The 2N fermion state 

2N 

H 4. c . |0> where (n^aj) + (n k , a k ) for j + k (9.117) 
i=i 

are eigenstates of (9.115) with eigenvalues 

2N 

E(m,ai; n 2 ,a 2 ; ...n 2 N,^2N;) = ■ (9.118) 

Ground State 

In case of an ordering e m < e n for m < n the state of lowest energy, the so-called ground state, is 

N 

|*o> = II d] a d]p\0) . (9.119) 

3=1 

In this state the N lowest orbital eigenstates of t rs are occupied each by an electron with spin 
|a) = 5) and \(3) = ||, — 5)- A non-degenerate orbital state which is occupied by two fermions, 
i.e., a fermion in a spin state ||, \) as well as a fermion in a spin state ||, — |) is called a closed 
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shell. The ground state has only closed shells. We will demonstrate now that this property endows 
the ground state with total spin zero. 

In order to investigate the total spin of |3> ) we apply to this state the total spin operator S 2 as 
given by (9.97). The spin operator in the present representation of single-particle states is 

s s 
S 2 = \n + - ft™ ~N m p){N na -N n p) - S m J nfi d mfi d nOL (9.120) 

m,n=l m,n=l 

where the occupation number operators refer to the single-particle states \m). The action of the 
operator (9.120) is particularly simple for closed shells. One can conclude immediately that the 

second term in (9.97), namely, \ ^ n=1 (iV ma — N m p)(N na — N n p) does not give any contribution 

if either |m) or |n) are closed shells. Similarly, the third term — n=1 Cm a c nl3 c m fsc na does not 
give a contribution if m ^ n and either \m) or |n) are closed shells. In case of m = n it gives a 
contribution '1' for each closed shell which cancels the contribution of the first term. Alltogether, 
one can conclude that |3> ) is an eigenstate of S 2 with eigenvalue zero, i.e., is a singlet state. 



Excited States 

We want to construct now excited states for the system governed by the independent-particle 
Hamiltonian (9.100). Obviously, the states with energy closest to the ground state are those in 
which a particle is promoted from the highest occupied state \N) to the lowest unoccupied state 
\N + 1). There are four such states, namely, 

| a, a)) = d ] N+la d Na \^ ) 

K/3» = d ] N+la d N p\$ ) 

a)) = d^ N+lj3 d Na \^ ) 

10,13)) = S N+1/3 d Nf3 \^ ) (9.121) 

All four states have the same excitation energy, i.e., energy above the ground state, of AE = 
6jv+i — £tv- However, the states differ in their spin character. The two states \a,(3)) and \/3, a)) 
are eigenstates of S 2 given in (9.120), both with eigenvalues 2 

£>,/?» = 2\a,P)); S 2 \\^a)) = 2|/3,a». (9.122) 

This property can be derived as follows: The two states are obviously diagonal with respect to the 
following contribution to S 2 

j ^ N-l N-l 

Si = 2^ + 4 t^rna ~ N mP ){N na - N n p) - 4nadlpd m pd na (9.123) 

m,n=l m,n=l 

which acts only on the N — 1 closed shells (except for TV) of the two states and has an eigenvalue 
N + — (N — 1) = 1 in both cases. The remaining contributions to S 2 which act on partially 
occupied orbitals \N) and \N + 1) are 

-. N+l 

t2 = 4 E - ti m p){N n * ~ N n(3 ) (9.124) 

m,n=N 
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and 

N+l 

±3 = ~ Yl dtnadlpdrnpdna • (9.125) 
m,n=N 

Contributions to S 2 acting on states \N + 2), \N + 3), . . . do not need to be considered since they 
give vanishing contributions. The two states \a,(3)), \/3,a)) are, of course, also eigenstates of £2, 
both with eigenvalues 1. The action of S3 on the two states produces , for example, for \a, (5)) 

N+l 

t 3 \a,/3)) =- ^ dlnadlpdrnpdnady^dNplQo) = (9.126) 
m,n=N 

which follows from the occurence of squares of fermion operators which, of course, vanish. The 
same result holds for \(3,a)}. One can, hence, conclude that (9.122) is correct. 
An eigenvalue '2' of the spin operator S 2 identifies the states \a,(3)) and \(3, a)) as triplet states. 
The remaining states \a, a)) and \{3, j3)) are not eigenstates of S 2 , however, the linear combinations 
^/l/2(|a, a}} ± \/3,P}} qualify as eigenstates, one with eigenvalue zero (singlet) and one with 
eigenvalue '2' (triplet). 

Exercise 9.4.2: Construct four operators 0( m 

0,0' 

such that Oe m \& ) are triplet and singlet states, i.e., apropriate eigenstates of the operators S 2 and 
S 3 - 



Example: 2N Independent Electrons on a Ring 

In order to illustrate the procedure to obtain eigenstates of one-particle Hamiltonians (9.100) 
outlined above we consider a system of 2N electrons which move in a set of atomic orbitals 
|r), r = 1,2, ...2JV which are located on a ring. The system is assumed to posses a 2iV-fold 
symmetry axis and interactions connect only orbital states |r) — > \r ± 1). We identify the states 
\2N + 1) = 1 1) to avoid cumbersome summation limits, etc. The system is described by the 
Hamiltonian 

2N 

Ho = ~tJ2 ( C l l+ laCna + 4^+1.) ■ (9.128) 

n=l 
a 

The cyclic property of the system is expressed through 

C2N+la = Cl a ; cjtf+ltr = C L (9.129) 

The symmetry of the Hamiltonian suggests to choose a new representation defined through the 
operators (r = 1,2,... 2N) 

2N 

V2N ^ 

v n=l 
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2N 

dl = J2 eir ™ /Nc n<r ( 9 - 131 ) 

* n=l 

(9.130, 9.131) constitute a unitary transformation U denned through U rn = ^^ e%rniT ^ N ■ The 
unitarity property follows from 

2N 

^ e irS7r/N = for < s < 2N . , (9.132) 

r=l 

an identity which can be derived employing the well-known expression for a finite geometric series 

jr a s = a^^— . (9.133) 

s=l 

Application to the l.h.s. of (9.132) gives for r / 0, 2N, AN,... 

i _ 2Nisn/N 

e isn/N . (9.134) 

I _ e isn/N y * 

which, indeed, vanishes for integer s. One can then express[i\ (9.109)] 

2N 



1 J^ e irn,/N dra (9135) 



f 2N 

r=l 



2N 

J = 



V2N 

v r=l 

and, thereby, one obtains the new representation of H Q 

2N 



r;,„ - l^e-*™'"^ (9.136) 



#o = -t l rsdt c d sc (9.137) 



r,s=l 



where 

2AT , 2iV 



^ _ c -irn/N \ ^ e »(s-r)n7r/Af _j_ ^sir/N \ e i(r-s)nn/N _ (9.138) 

n=l n=l 

Using (9.132) one can conclude 

t rs = (V™/^ + g -W^ s rs (9.139) 



from which follows 

2N 



60 = E ( cos ^ ) • (9-140) 



r=l 

tr 

The Hamiltonian is now diagonal and the construction outlined above can be applied. 
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Exercise 9.4.3: Construct the ground state and the lowest energy excited state(s) for a sys- 
tem of 2N electrons moving in a linear chain of single-electron orbitals |n), n = 1,2,... 2N with 
interactions between neighbouring (n, n ± 1) orbitals and Hamiltonian 

2N-1 

Ho = ~t ( c l+l* c nv + 4a c n+l<x) • (9.141) 

n=l 

IT 

State the excitation energy AE. 



9.5 Self-Consistent Field Theory 

Observations of many-fermion systems show that properties of such systems often can be explained 
to a surprisingly good degree on the basis of the assumption that the fermions move like independent 
particles. The most famous example is the periodic table of the elements. The regularities of the 
elements can be rationalized in terms of electrons moving in hydrogen atom-type orbitals, i.e., 
in states which are constructed according to the 'Aufbauprinzip' in a manner which neglects the 
mutual repulsion between electrons. A similar principle accounts approximately for properties of 
nuclei as described by the nuclear shell model. The properties of molecules can be understood 
largely in terms of models which place electrons into so-called molecular orbitals in which the 
electrons move as if they were not interacting with other electrons. Many electronic properties 
of solids can be understood in a similar way, i.e., assuming that electrons move independently 
from each other in so-called bands, e.g., valence bands or conduction bands. Of course, properties 
of many-fermion properties deviant from such simple description is then of as much interest as 
independent fermion descriptions are useful. 

Since independent-particle behaviour is so universal one may wonder why the presence of particle- 
particle interactions, i.e., the everpresent Coulomb repulsion between electrons, does not spoil it. 
The reason is mainly connected with the fact that most many-fermion systems are found in their 
ground state or removed from it through low energy excitations, and that independent-particle 
behaviour surfaces mainly because it applies well to the ground state. There are two reasons 
why this is so: first, if one places fermions into an independent-particle ground state of the type 
(9.119) then perturbations due to pair interactions according to the Pauli exclusion principle can 
only involve independent particle states not occupied in the ground state, i.e., those with energy 
above that of the (energetically) highest occupied single-particle state. Hence, the fermion nature 
restricts the possibility for perturbations on the system, in particular, if one can choose the single- 
particle states such that the residual perturbations are small. We will present in this and the 
following section a method which constructs such optimal single-fermion states. These states do 
not altogether neglect the pair interactions, rather they assume that each particle experiences the 
average interaction due to the fermions frozen into the ground state. 

A second reason why independent-particle models can be successful is that the pair interaction for 
electrons is the slowly decaying Coulomb interaction. The slow decay of the interaction makes the 
motion of any one electron rather independent of the exact position of most other electrons, i.e., 
one expects that mean-field descriptions should be rather sufficient. 
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In this section we will consider systems of 2N fermions described by the spin-independent Hamil- 
tonian 

s I s 
H = ^2 ( r \i\ s )4<T c so + 2 ^2 ( r ' s l*l*' u ) c L c L' c ^' c ^ (9.142) 

r,s=l r,s,t,u=l 
a a,<j' 

and derive a single-particle operator which approximates this Hamiltonian. 

In our formulas below we will adopt strictly the convention that indices r,s,t,u refer to the initial 
basis of single-particle states \r),r = 1, 2, . . . , 5 on which (9.142) is based. We will adopt a second 
set of single-particle states {|m), m = 1, 2, ... S} the elements of which will be labelled by indices 
m, n,p, q. The states \ fh) are connected with the states |r) through (m = 1, 2, . . . S) 



\m) = Y, U rm\r); \r) = £ U* rm \rh) . (9.143) 

r=l m=l 

These S states are associated with the creation and annihilation operators 

s 

draa = ^ ] U rm c \cr 
r=l 
S 

d-ma = ^ ^ Urm c ra (9.144) 
r=l 

defined as in (9.108). The states \rh) serve to define a reference state of the system 

N 

|$ G (U)) = n 4J jp \0) . (9.145) 

which will play a pivotal role: it is this the state in which the fermions are assumed to be moving 
and relative to which the mean interactions acting on individual fermions is determined. 
U = (U rm ) as defined in (9.143) with r,m = 1,2, ...S is a unitary matrix. If one restricts 
m = 1, 2, . . . N, as in in (9.145), the corresponding U = (U rm ) forms an 5 x N matrix and, 
naturally, is not unitary. 

Mean-Field Potential 

The mean field for the reference state |$ (U)) is defined in a straightforward way by averaging 
the two-particle contribution to (9.142) over this reference state to turn the contribution into an 
effective one-particle operator. This is done as follows: 



tW(|<MU)» = 

1 s ( 

2 £ (r,s\v\t,u)( {{cl c c\ al ))c u<J >Cta + 4a c la'((^' C t<y)) 
r,s,t,u=l 

<j,a' 

ru c tcr)) c ua' 

~ 4a(( C L> C tv)) C ua' ~ 4a'((4a C ua'))c t a ^ (9.146) 
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Here ( ( • • • ) ) denotes the average 

«0» = <$ (U)|0|$ (U)> . (9.147) 

Since the reference state |<& (U)) is defined in terms of the single-particle states \rh) it is preferable 
to switch the representation of (9.146) accordingly. Since the averages affect only two creation- 
annihilation operators actually a mixed representation is most suitable 

1 S 

V m f (|$ (U))) = - Yj (^^\^ u )((4na d na')) C ^ ,C t^ 



where, for example, 



m,n,t,u=l 



+ 



+ 



1 



r,s,m,n=l 
a,cr' 



1 S 

- ^ { r ^ fh W,n)cl a {{d) m(jl dna'))cta 



r,m,n,t=l 
cr,a' 



1 S 

- Yl ^^\h,u)c\ a ,{{d ma d n(T ))c v 



m,s,n,u=l 
<T,cr' 



1 

- Y {r,m\v\n,u)ci a {{d} mal dna))c U(jl 



r,m,n,u=l 
a,cr' 



1 S 

- Y (m,s\v\t,n)cl a ,((dl ia d n(7 ,}}cta 



m,s,t,n=l 
ct.ct' 



(r,m\v\t,n) = ^ (r, s\ v t, u) U* m U un . 



(9.148) 



(9.149) 



The remaining matrix elements appearing in (9.148) are obtained in a similar way by transforming 

only two of the four single-particle states into the new representation. 

For the averages (( • • • )) in (9.148) one obtains, by means of the rules (9.84), 



and, hence, 



((dmadna')) 
((dlnadna')) 



<$ (U)|4, a 4 CT ^o(U)> = 

{<S> (U)\d ma d na ,\<!> (U)) = 

($ (u)|dL^l^(u)) 

SmnSaa' m < N 

m > N 



1 5 

V m f (|$ (U))) = ~Y ^{ r ^\v\t,h}cl (T c tcr <W 



m<N r,t=l 
c.c' 



(9.150) 
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1 S 
+ - ^] y^^(rh,s\v\h, u)c\ a , 



m<N s,u=l 
<T,cr' 

S 

r / \ i . i i i i t ■■ i f / , . 1 1 . i i 

2 



m\v\n, u)c' ra c uu 



m<N r,u=l 
u 

1 5 

S^' 5 ^!*'™)^ ^ • (9.151) 

m<JV s,t=l 

Carrying out the sum Y^^ <5o- CT = 2 leads to a factor 2 for the first two terms. Exploiting the sym- 
metry (r, s\v\t,u) = {s, r\v\u, t) and renaming dummy summation indices one can demonstrate 
that term one and term two as well as term three and four are identical and one obtains 

s 

V mf (\&o(V))) = 2 (r,m\v\t,n)cl a c ta <W 

r,m,t=l 
cr,cr' 

s 

- ^ (r,m\v\h,u)ci a c ua > (9.152) 

r,m,u=l 

By means of expression (9.149) for the matrix elements in the mixed representation one can state 
the right hand side explicitly in terms of U rm . The one-particle operator (9.152) can then be written 
in the form 

V mf (|*o(U))) = ^ Vrnf (|*o(U))) \S) c\ a C S(J (9.153) 

r,s 
a 

where 

<r|t> m/ (|$ (U)» \s) = J2(2(r,t\v\s,u) - (r,t\v\u,s)) I £ U* m U um \ (9.154) 

t,u \m<N J 

The mean field approximation replaces then the Hamiltonian (9.142) by H m f(U) defined through 

S 

H mf (U) = Y, (<*> + <r|t) m/ (|* (U))) \s)) c\ c c sa (9.155) 

r,s=l 

which is a function of the S 1 x TV-matrix 



U = (U rm ) r=1 , 2 ,... S ;m=l,2,...N (9-156) 

which defines the reference state |<fr (U)). The mean field approximation, as introduced here, leaves 
open the question how the reference state should be chosen. At this point this state is completely 
arbitrary. We will address now a proper choice of the reference state. 
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9.6 Self-Consistent Field Algorithm 

The Self-Consistent Field (SCF) approximation, often also referred to as the Hartree-Fock approx- 
imation, is based on the mean field approach and provides an algorithm to construct a reference 
state |$ G (U)) for a 2N fermion system described by a spin-independent Hamiltonian (9.142). The 
procedure determines the reference state as the ground state of the independent-particle Hamilto- 
nian (9.155) following the method outlined in Section 9.4. This procedure, however, can achieve 
this goal only iteratively, assuming in an intial step (1) a properly chosen reference state, deter- 
mining in a step (2) the corresponding mean field Hamiltonian (9.155), and obtaining in a step (3) 
the ground state of this one-particle Hamiltonian according to the construction in Section 9.4 and 
defining this as the new reference state; steps (2) and (3) are being repeated M times until the 
procedure converges, i.e., the reference state resulting from step 2M + 1 (within numerical error) 
is equal to the reference state resulting from step 2M — 1. The state thus determined is referred to 
as the self-consistent independent-particle ground state, the independent-particle nature stemming 
from the fact that the functional form of the ground state, i.e., (9.119, 9.145), is exact only for an 
independent-particle Hamiltonian. We will argue below that the self-consistent field ground state, 
under conditions which often are realized, is the lowest energy independent-particle ground state 
one can construct. 

Let us state now the construction of the self-consistent field ground state in more detail. 



SCF- Algorithm, Step 1: Choosing an Initial Reference State 

One defines the Hamiltonian 

S 

= £ (r\i\ S }4,c sa (9.157) 

r,s=l 

(T 

and determines the associated diagonal representation defined through 

S 

£>|i»t/« = e« l/W , r = l,2,...S, m=l,2,...S. (9.158) 

s=l 

where the labels m are ordered to obey the condition 

e$ < for m < n . (9.159) 

One defines the reference state |$ (U^ 1 - ) )) using the definition (9.145), i.e. U^ 1 ) is the S x iV-matrix 
of the first N column vectors of l/im, r = 1, 2, . . . S , m = 1, 2, . . . S. 



SCF- Algorithm, Step 2M, M =1,2,...: Determine Mean Field Hamiltonian 

In this step the mean field Hamiltonian 

H ( Z ] = £ (<*> + (r\v mf (\M^ M) ))) \s)) clc sa 

r,s=l 
a 

s 

= E {r\h^\s)cl c c sc (9.160) 

r,s=l 
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is evaluated, i.e., the S x 5-matrix 

S / N \ 

(r\h { ^\s) = (r\i\s) + £ (2(r,t\v\s,u) - (r,t\v\u,s)) £ ltf°C$P (9.161) 

t,u=l \m=l / 

is calculated. 

SCF- Algorithm, Step 2M + 1, M = 1, 2, . . .: Diagonalize Mean Field Hamiltonian 

In this step the eigenvalue problem 

E<r|0*>« +1) = 4 M+1) « +1) , r = l,2,...S,m = l,2,...S (9.162) 

s=l 

is solved adopting an ordering of the labels m which obeys the condition 

4f +1) < 4 M+1) ^ m < n . (9.163) 

The result yields the reference state |<E> (U( M+1 ))) using the definition (9.145), i.e. Ij( M+1 ) is the 
S x iV-matrix of the first N column vectors of Urm +1 \ r = 1, 2, . . . S , m = 1, 2, . . . S. 

SCF- Algorithm: Continuation and Convergence Condition 

When step 2M+ 1 is completed, step 2M + 2 is carried out, etc. The procedure is continued until it 
is detected that the one-particle density differences | X^m=i U*m^ +1 ^ uim +1 ^ — Sm=i ^*i M '^™'l 
for r, s = 1, 2 ... S do not exceed a preset threshold, indicating that the state converged. 



Exercise 9.6.1: Let P rs be the one-particle operator with the generic operator p rs = \r)(s\, i.e., 

(t,a\p rs \u,a') = 6 tr S su S aa > . (9.164) 

(a) Determine the expectation value of P rs for the state as defined in (9.145). 

(b) How can the expectation values of the operator P rr be interpreted. 

(c) Show that any spin-independent one-particle operator 

s 

F = Y, ( r \f\ s )4oCs* (9.165) 

r,s=l 
a 

can be written 

S 

F = ( r \f\ s ) p rs (9.166) 

r,s=l 

(d) Define a similar two-particle operator P rs tu- 

(e) Consider the creation operators 

N 

4a = E F -^L (9-167) 

m=l 
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which are connected with d\ ncr through a unitary N x iV-matrix (V qm ). Note that this is not an 
S x S'-matrix! Show that the reference state defined through 

N 

1144/31°) ( 9 - 168 ) 

9=1 

has the same expectation values for P rs and P rs t u as the reference state (9.145). 

(f) Can one distinguish the states (9.145) and (9.168) through a physical observation? 

Exercise 9.6.2: Determine the SCF ground state and its energy expectation value for a system 

of two particles described through the Hamiltonian (S = 2) 

H = ^2[e(cl a c la + 4 CT c 2(J ) - t(c\ a c 2(T + 4 CT ci a )) 

+ 2V ( c\ a C la c\ p C ip + 4a C 2«4/3 C 2/3 ) + « 4a C l^ C L' C 2^' 

+ 4<r c 2a4 ( 7' c i^ ) • (9.169) 



9.7 Properties of the SCF Ground State 

We want to investigate now the properties of the SCF ground state. We begin by summarizing the 
result of the SCF algorithm. 

We denote the representation which results from the SCF algorithm after its convergence as follows: 

R = E U r m CF) W ■ (9- 17 °) 
r=l 

Here |r), r = 1, 2, ... S denotes the initial single-particle states and (jJr m CF ^ is the unitary S x S- 
transformation matrix obtained in (9.162,9.163). For this representation holds 

N 

(m\t\h) + ( 2{mm'\ v \hrh') — (mrh'\ v \m'h) ) = ejf^' 5 mn (9.171) 

ml 

where the convention (9.163), i.e., 

4 SCF) < ^n CF) for m < n (9.172) 
is obeyed. Equation (9.171) implies that for the mean field Hamiltonian holds 

Hmf(V^) = £ tfWdLAn., (9-173) 
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i.e., the mean-field Hamiltonian is diagonal in the SCF representation. Finally, the SCF ground 
state is 

N 

\\SCF) = dlJ m ^) . (9.174) 

m=l 

The first property we consider is the total spin character of \\SCF). Since this state is composed 
only of closed shells one can conclude following Section 9.4 that \\SCF) is a total singlet state. 
We like to determine next the energy expectation value of \\SCF). Within the mean field approxi- 
mation as described by (9.173) holds 

N 

(SCF\\H mf (Xj( SCF 1)\\SCF) = 2 ^ CF) ■ (9-175) 

However, one can also determine the expectation value of \\SCF) for the Hamiltonian (9.142), i.e., 
(SCF\\H\\SCF). Employing expressions (9.84) and (9.93) for the expectation values (diagonal 
elements) of the one-particle and two-particle part of the Hamiltonian (9.142), exploiting the spin- 
independence of (9.142), one obtains 

(SCF\\H\\SCF) = (9.176) 

N N 
2 Y^(m\i\m) + ( 2(mm' \ v \ mm) — (mm'\v\m'm)} . 

m=l m,m'=l 

Comparision with (9.171) yields 

(SCF\\H\\SCF) = (9.177) 

N N 
2 ^ f {f Cf ' — (2(mm\v\mm) — (mm'\v\mm)) . 

m=l m,m'=l 

The second term originates from the fact that in the mean field approximation one assumes that each 
of the fermions is subject to an average pair interaction involving a 2A^-particle reference state. 
Since the system under consideration has only 2iV particles altogether, the mean field potential 
^WG^o(U))) defined in (9.146) counts a particle twice, once as a member of the ground state, 
and once as a probe particle being subject to the mean pair interaction. This over-counting leads 
to the correction term in (9.177). 



Exercise 9.7.1: Reformulate (9.171) in terms of matrix elements (r\i\s), (r,s\v\t,u) and Ur^ F \ 
Show that the resulting eigenvalue problem of the type (9.162) is non-linear in Urm 
Exercise 9.7.2: Derive (9.177). 
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9.8 Mean Field Theory for Macroscopic Systems 

The SCF algorithm has been developed in Sections 9.4-9.7 for finite systems. In the present Section 
we want to adapt the algorithm to systems containing a macroscopically large number of fermions. 
We consider as an example the Hubbard model of an infinite linear lattice and apply the model to 
describe magnetic instabilities in metals. 



The Hubbard Model 

The Hubbard model serves today the important role as the simplest manifestation of strongly 
correlated electron systems which arise in many instances in molecular and solid state physics, for 
example, in the two-dimensional copper oxide lattices of high temperature superconductors. The 
one-dimensional Hubbard model is described by the Hamiltonian 

H = ~t (4+1,ct C ™ + cLc r +l,a) + | ^ C lcr C la' C ro' Cra ■ (9.178) 

rcr r 

Here the index r, r £ Z, describes the sites of a linear lattice. We assume that there are altogether 
S = 2Nq lattice sites, labeled r = —Nq + 1, . . . , Nq which are populated by 2N electrons (we choose 
an even number of electrons for convenience). Both iVo and N are macroscopically large numbers. 
The operator c rc r (c rcr ) creates (destroys) an electron with spin a (a = a, (3 for 'up' and 'down' 
spins, respectively) in state |r) at lattice site r. t describes the coupling of state \r) to states 
\r ± 1) at the two neighboring lattice sites; t is assumed to be a positive, real number. According to 
(9.178) v contributes only in case that two electrons occupy the same lattice site, i.e., v represents 
the 'on-site' Coulomb repulsion. 

The first term on the r.h.s. of (9.178), is identical to that of the independent-particle Hamiltonian 
H (9.128), i.e., 

H Q = -t^2 (4+1,0-°™ + c\ a c r+ i^ . (9.179) 

rcr 

The potential energy term, i.e., the second term on ther.h.s. of (9.178), in the usual two-particle 
operator form, reads 

- ^2 (r,s\i>\t,u) cl a cl a ,c ua >c t a ■ (9.180) 



V 



2 

r,s,t,u 



where 



(r,s\v\u,t) =vS ru S st S rs . (9.181) 

The Hubbard model, as stated through Hamiltonian (9.178), is characterized through the parame- 
ters t, v of the Hubbard Hamiltonian (9.178) and through the so-called filling factor n 

2N N 

»=-s=«s- <« 82 > 

n can assume values < n < 2. The case n = 1 is termed the half filled band case, referring to the 
fact that in this case the band of single particle energies (9.192) derived below is filled half. 
In spite of the simplicity of its Hamiltonian the Hubbard model, except in case of one-dimensional 
systems, cannot be solved exactly. Due to the lack of an exact solution in dimensions higher than 
one, approximation schemes like the self-consistent field approximation play an important role. 
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Since we are dealing presently with a macroscopic system, boundary effects are assumed to be 
negligible, a freedom which we employ to adopt so-called periodic boundary conditions like those 
for the example "2iV independent electrons on a ring" on page 262. Accordingly, we identify 
\r + Nq) = \r) and later let iVo go to infinity. 



SCF Ground State for the Hubbard Model 

We seek to determine the ground state of the Hubbard model stated by (9.178) within the framework 
of the self-consistent field theory. For this purpose we apply the SCF theory presented in Section 9.6. 
Accordingly, we assume that the SCF ground state is given by (9.174), i.e., by 

N-l 

\\SCF)= l[dlJ mf} \0), (9.183) 

m=0 

where the operators d} m(J are related to the original creation operators c\ n through the unitary 
transformation U defined in (9.144). The self-consistent field Hamiltonian H m f which corresponds 
to the Hubbard Hamiltonian (9.178) can be determined by applying (9.155, 9.154). Using (9.181, 
9.179), one obtains for the mean field Hamiltonian 

H mf (V) = H + U rmU rm \ 4*Cra ■ (9.184) 

r,a \m=0 / 

We are now ready to apply step 1 of the SCF algorithm and diagonalize term Hq. This task has 
been solved already on page 262. The unitary transformation which diagonalizes Hq, in fact, is 

U rm = -7= exp (irmir/No) . (9.185) 
V2iVo 

According to (9.140), the operator Hq can be re- written in the diagonal form 

N 

H = Yl trnd\ na d m a (9.186) 
m=-N () + l 

where 

TTITT 

e m = -2icos— . (9.187) 
iVo 

The energy levels are labeled such that eo < e±i < . . . < e ± ^ _ 1 ) < ejv holds as can be readily 
verified. 

We can now embark on step 2 of the SCF algorithm. Inserting (9.185) into (9.184) and using 
(9.186) one obtains for the mean field Hamiltonian 

h < S } = E e »iA.+^E c ^' ( 9 - 188 ) 

m= _AT +l ra 

a 
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Expressing c\ a , c ra through d\n a , d ncr according to (9.109) and using (9.185) one can prove 

N 

J24 a c r(7 = Yl d Ldm*. (9.189) 

rcr m=-No+l 
<r 



N 



and, hence, 

H m f = H%)= £ {e m + V -^)dl a d m(T (9.190) 

m=-N +l 
a 

We have used in the latter expression the definition (9.182) of the filling factor n [c.f. (9.182)]. 
The Hamiltonian H m f, as given in (9.190), is already diagonal, i.e., the self-consistency condition 
is exactly met and the SCF algorithm converged after two steps. In other words, the ground state 
of the Hubbard model in the self-consistent field approximation can be determined exactly. The 
energy E m f of this state can be calculated readily by using (9.177) and 

( mm | v | mm) = (mm'\v\m'm) 



2N 

which holds in the present case. E m f is then 

N-l N-l 

Emf = 2 £ (e m + —) - N— = 2 (em + T ) (9.191) 

m=0 m=0 

where e m is given in (9.187). Apparently, the electrons in the present description behave like 
independent particles with energies e m = e m + ^ . 

So far we have not exploited the fact that No and N are macroscopically large quantities. When 
iVo becomes macroscopically large the single particle discrete energy levels (9.187) form a quasi- 
continuous energy band. Indeed, for k m = mix /No holds k m £ j— ir + , — tt + ^ . . . , 7r| and, in 
the limit iVo —> oo, holds k = k m g] — tt, tt] . The energy e m can be expressed as a function of a 
continuous variable k g] — tt, tt], namely, through the so-called dispersion law 

e(k) = -2tcosk , - it < k < it . (9.192) 

This dispersion law is presented in Fig. 9.1. One can see that —2t < e{k) < 2t holds. The bottom 
of the energy band corresponds to e(0) = — 2t and the width of the energy band is At. The ground 
state of the system, i.e., the state with the lowest possible energy, can be obtained, according to the 
Pauli principle, by filling up this energy band with electrons from the bottom of the band (which 
corresponds to k = 0) to a maximum energy, called the Fermi energy, denoted by €f (see Fig. 9.1). 



Alternative Description of the Mean Field Approximation 

The state (9.183) is not the only candidate for the mean field ground state of the Hubbard model. 
To demonstrate this we consider an alternative formulation of the mean field approximation for 
Hamiltonian (9.178). Indeed, there exist several ways of formulating mean field approximations, 
even for one and the same Hamiltonian. Usually, the results of different formulations agree quali- 
tatively, but may differ quantitatively. 



9.8: Macroscopic Systems 



275 



A e(k) = -2tcos(k) 




Figure 9.1: The dispersion law e{k) for independent electrons on an infinite one dimensional lattice. 
€f = e(±kp) denotes the Fermi energy. 



First, let us express the interaction term in (9.178) in terms of the occupation number opera- 
tors N ra = ctaC ra . The anti-commutation properties (9.45, 9.46) of the fermionic creation and 
annihilation operators yield 

c ra C ra' Cr,J ' Crcr = ~ < ^ra < ^ra' C ' rijCra '' = 

ct a c ra cl a ,c rr7 > - 4 CT c rCT <W = N ra N ra > - N^5 a(jl , (9.193) 

where, in the last term on the right hand side, we have employed (9.56). Inserting (9.193) into 
(9.178) and performing the summation over the spin indices results in 

H = H + v^N ra N r p . (9.194) 

r 

For a given reference state, say the one given in (9.183), the operator N ra can be written 

Nra = (Nra) + 6 (JV W ) , (9.195) 

where (N ra ) is the mean occupation number of the one particle state \ra), i.e., 

(N ra ) = (SCF\\N ra \\SCF) , (9.196) 

and where 5 (N ra ) = N ra — (N ra ) describes the fluctuations of N rcr , i.e., the deviation of N rcr from its 
mean value {N ra ) for the corresponding reference state. The mean field approximation neglects the 
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fluctuations to second order and the mean field Hamiltonian is obtained from (9.194) by dropping 
all the terms which contain the fluctuations to second order. In this approximation one can express 

N ra N rP = ((N ra ) + S (N ra )) ((N rf3 ) + 6 (N r(S )) 

« {N ra ){N r p) + {N ra )S (N rP ) + {N r/3 )S (N ra ) 

= (N ra )N rf3 + (N r/3 )N ra -(N ra )(N rP ) (9.197) 

and the corresponding mean field Hamiltonian becomes 

H rnf = H + vY^ {{N ra )N rf3 + (N rf) )N ra ) - v ^2{N ra ){N r p) . (9.198) 

r r 

Let us assume that the ground state of the Hubbard model, in the mean field approximation, is 
given by (9.183). Because of 

{N ra ) = {c\. a c ra ) = y U rm U rm i {$ ma d m < a ) 

m,m' 

N ~ 1 AT 

- ^ u rm u rm - 

m=0 u 

holds 

H mf = H + — ^ 4. a c ra - 2N — 

ra ~ 

= E ( e ™ + y) d ™ dma ~ N Y ' (9 - 199) 

ma 

which is identical to expression (9.191) for the ground state energy. Note that the procedure em- 
ployed in (9.195), i.e., separating the occupation number operator into a mean value plus fluctuation 
and neglecting the fluctuations in 2nd order, yields essentially the same mean field Hamiltonian 
(9.198) as the one obtained by using the mean field potential (9.146). 



Spin-Polarized Mean Field Ground State 

We want to consider now a ground state for H m f defined through 

(N ra ) =n a = const (9.200) 

allowing, however, that n a and np assume different values. For such state the mean occupation 
number n ra is uniform at all lattice sites, but the mean number of particles with spin a can be 
different from the mean number of particles with spin (3. Therefore, the ground state of the system 
can have a non-zero local spin and, consequently, non-vanishing magnetization. In case n a = np 
the following construction will lead to a ground state which coincides with the non-magnetic ground 
state (9.183) . 

To determine a magnetic ground state we note 

^ ] N r a = ^ ] I ^ ~] U rm U rm / I dl ncr d m ' a = ^ ] ^mm'^ma = ^ ] N ma , 
r mm' \ r / mm' m 
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from which we obtain 



The mean field Hamiltonian is then 

Hrnf = ^2 [( €m + vn ^> ^ ma + ( £m + vn °^ ^mp ~ 2N o vn an/3 ■ (9.202) 
m 

This Hamiltonian describes a system of non-interacting electrons with a spin-dependent dispersion 
law 

e m a = e m + vn- a . (9.203) 

The actual values of n a [o = a, (5) are determined by minimizing the energy density of the ground 
state 

£mf = = E I^Wl " ( 9 - 204 ) 

m,cr 

with respect to n a and ng, subject to the constraint 

n = n a + np . (9.205) 

Such minimization is realized through the method of Lagrangian multipliers. According to this 
method one minimizes 

£ m f = £ m f + f-L (n - n a - np) , (9.206) 

where ri is the Lagrangian multiplier considered an independent variable, i.e., £ m f is minimized 
with respect to n a , np and a. The additional term in (9.206) ascertains that condition (9.205) is 
met. 

At this point we exploit the fact that our system is macroscopically large, i.e., iVo ~ 10 23 . In this 
limit the discrete energy spectrum (9.203) is provided by the continuous function 

e a (k) = e(k) + vn_ a , (9.207) 

where e(k) is given by (9.192). This last equation tells us that the electrons with spin a {(3) are 
accommodated by an energy sub-band e a {k) (ep(k)) which is obtained from the dispersion law of 
the non-interactive electrons e(k) by an overall shift of vnp (vn a ). The ground state (corresponding 
to the lowest possible energy) of the many electron system is obtained by filling these two energy 
sub-bands with electrons up to the same maximum energy value, the so-called Fermi energy €f = H 
(see below), as is shown schematically in Figure 9.2a. Since ep{k) — e a {k) = v(n a — np), one can 
see that an uneven occupation by electrons of the two energy bands yields a relative shift of the 
energy bands with respect to each other, e.g., if n a > np then ep(k) > e a (k), such that the larger 
n a , the smaller is e a (k) for a given k. Therefore, one can expect the system to lower its energy by 
assuming n a ^ np and, hence, a spin-polarized ground state. 

The values of n a and np are determined by minimizing the energy density (9.206). For this purpose 
the sum in (9.206) needs to be evaluated. In the limit Ao — > oo this sum can be expressed as an 

integral. In fact, for any function / (j^j holds (compare with the definition of a definite integral 

as the limit of the corresponding Riemann sum) 

m=— Nq+1 
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(a) (b) 

Figure 9.2: (a) Relative position of the two energy sub-bands e a (k) and ep(k) for A = n a — np > 0. 
The shaded areas represent the filled portions of these bands by electrons, (b) Graphical definition 
of the limiting energies e a and ep. 



If the function / depends explicitly only on e(k) one can state 

/n jl poo 

^ n<k))= y p(e)de/(e) (9 - 2o9) 

where 

—S(e-e(k)) . (9.210) 

Here S(x) is the Dirac-delta function and p(e), which is usually called the density of states, gives 
the the number of available one particle states per site, unit energy and a given spin orientation of 
the particle. 

For a given dispersion law e(k) the density of states p(e) can be calculated by employing equation 
(9.210). In our case e(k) is given by (9.192) and, therefore, holds 



f n dk 

/ — 5{e + 2tcosk) . (9.211) 



In order to calculate the integral we recall the following property of the Dirac-delta function 

where Xi are the simple roots of the function f(x). Since the equation f(k) = 2tcos k + e = has 
two simple roots, namely k\^ = ±(vr — arccos(e/2£)), and noting 

\f'(k)\ = \2tsmk\ = 2ty/l - cos 2 k = ^(2t) 2 - e 2 , 

one obtains 

S(k + tt — arccos(e/2t)) + S(k — ir + arccos(e/2t)) 

die — elk)) = = . 

V(2t) 2 - e 2 

Inserting this last result into (9.211) and using 



/7i 



dkS(k ± (tt - arccos(e/2f))) = 0(2t 
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where 9{x) is the step function, i.e., 9{x) = 1 if x > and 9(x) = if x < 0, one arrives at the 
following expression of the density of states 

= »(*-l<l) . (9 . 213) 

ir x /(2«) 2 -e 2 

The presence of the function in the above formula guaranties that the density of states vanishes 
for |e| > 2t, i.e., outside the energy band e(k); if we assume that e is restricted to the interval 
] — 2t,2t[ then the 9 function in (9.213) can simply be replaced by 1. 

We can employ the results obtained to express the ground state energy density (9.204) through an 
integral expression. One obtains 

£ m f = / ep{e)de + / ep{e)de + vn a np + p{n -n a — np) , (9.214) 

J-2t J-2t 

where e a denotes the value of e(k) which corresponds to the top of the filled portion of the energy 
sub-band e a (k) [c.f. Fig. 9.2], i.e., 

(-F = (-a + v np = ep + vn a ■ (9.215) 

n a and np in (9.214) are given by the sum (9.201). Replacing again the sum by an integral over 
energy one can write 

n a = p(e)de (9.216) 

J-2t 

Using (9.213) and carrying out the resulting integral yields 

n a =- arcsin(e (T /2t) . (9.217) 

7T 

Since £ m f [c.f. (9.206)] depends only on continuous quantities, namely, on e a , ep and fi, the necessary 
conditions for a ground state of minimum energy are 

^k = , ^k = , ^k = . (9.218) 
de a dep dp 

The derivatives can be readily determined and the conditions (9.218) read 

d£ m f 



de, 



e a + vnp- p = , (9.219) 



= n - n a - np = . (9.221) 



on 

— ^ = ep + vn a - p = , (9.220) 
oep 

and 

d£mf 

dp 

Equations (9.219-9.221) and (9.216) allow one, in principle, to determine the unknown quantities 
e Q , ep, p, n a and np as a function of the parameters of the Hubbard model, namely, of t, v and n. 
From (9.219, 9.220) p can be readily obtained 

\ DTI 

P=-(e a + ep) + — , (9.222) 
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Comparision of (9.222) and (9.215) reveals that p is indeed equal to the Fermi energy 6f (see also 
Figure 9.2b). 

One still needs to determine e Q and ep. For this purpose we consider the magnetization of the mean 
field ground state 

A = n a - np . (9.223) 

A = corresponds to the ground state (9.183) without magnetization. One can express A as a 
function of e a and ep in two different ways. First, from (9.219-9.220) and (9.223 one obtains 

A = — ^ , (9.224) 

v 

and second, from (9.217) and (9.223) follows 

A=[ a p(e)de=[ p{ep + e)de. (9.225) 

Jep JO 

Defining the function 

/(A) = / p (ep + e)de, (9.226) 
J o 

one can obtain A by solving (cf. (9.226) 

A = /(A) . (9.227) 

Since /(0) = one can infer that A = is always a solution (the so called paramagnetic solution) 
of condition (9.227). 

Equation (9.227) can be solved graphically. For this purpose one plots /(A) versus A as is shown 
schematically in Figure 9.3. We assume in the following discussion that the implicit A dependence 
of ep in /(A) can be neglected. 

From the definition (9.226) follows that /(A) is a monotonically increasing function of A. Indeed, 
the slope of /(A) is positive for all A > 0, as reflected by the expression 

= vp(ep + A) = - > . (9.228) 

Since the total number of states per site is finite, /(A) converges to a maximum value as A — ► 
^■max = n. Accordingly, condition (9.227) will have a second, non trivial, so called ferromagnetic 
solution, if, and only if, the slope of /(A) at the origin is larger than 1 [cf. Figure 9.3]. If the slope 
of /(A) at the origin is less than 1, condition (9.227) has only the trivial solution A = 0. 
The slope of /(A) at the origin depends on the Hubbard model parameter v. For v above a critical 
value v c , the Hubbard model has a ground state with A / 0, i.e., a magnetic ground state. One 
can determine v c by equating the slope of /(A) at the origin to 1, i.e., through the condition 
[cf. (9.228)] 

v c p(ep) = l, (9.229) 

This cindition is known as the Stoner criterion and gives the critical value of v which determines 
the onset of the ferromagnetic long range order. The Stoner criterion (9.229) has a wider range of 
validity than one can infer from the present analysis of the Hubbard model. 
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Figure 9.3: Graphical solution of the mean field equation A = /(A). 
By parameterizing e a and tp 

e a = 2tsinx, ep = 2t sin y , (9.230) 
one obtains from (9.219, 9.220, 9.205) the following set of equations determining x, y and, hence, 

x + y = Tr(n-l) (9.231) 
2t 

x — y = 7T — (sin x — sin y) = it A . (9.232) 

For a given ^ the values of x and y can be obtained numerically and other relevant quantities 
mentioned above can be calculated. 

We like to determine finally the dependence of on the filling factor n. The corresponding 
analytical expression provides the phase diagram of the ground state: when v approaches v c from 
above, the magnetization A of the ground state vanishes and remains zero for values v < v c . 
Equation (9.232) states that x — y is proportional to A. Since A vanishes near v c we set x = y + 5 
where 5 is a small quantity. From (9.231) follows 

y= ^ (9-233) 

Employing the approximation sin(y + 5) — s'my rs Scosy for small 5, (9.232) together with (9.233) 
yield the desired expression 

| = 7rcos[|(n-l)] . (9.234) 

The resulting phase diagram of the ground state, i.e., the plot of ?fe vs. n, is presented in Figure 9.4. 
As already mentioned, the ground state is characterized by the quantities (v, t, n). Accordingly, any 
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mean field ground state is represented by a point in the phase diagram. The magnetic nature of the 
ground state of the system depends on whether the representative point lies inside the ferromagnetic 
or inside the paramagnetic domain of the phase diagram. 




Figure 9.4: Ground state phase diagram of the mean field Hamiltonian (9.202). 



Concluding Remarks 

At the end of our analysis of the mean field ground state of the one-dimensional Hubbard model, 
it is natural to ask oneself if the results obtained reflect, at least qualitatively, the properties of 
the real ground state of the Hubbard model. Well, the answer is no. The real ground state of 
the one-dimensional Hubbard model is quite different in nature from the mean field ground state 
discussed here. The reason is that the effect of quantum fluctuations, which are neglected in the 
mean field theory, in one spatial dimension is very strong. 

Nevertheless, the analysis presented above is not quite useless. First, once we specify a class of 
possible states to which the real ground state might belong to, the method described above gives 
a systematic way of singling out the state with the lowest possible energy which might be a good 
candidate for the real ground state of the system. Second, since in one spatial dimension there are 
many exact results available, the application of the mean field theory for these systems provides an 
excellent testing opportunity of these theories by comparing their predicted results with the exact 
ones. Third, the mean field theory of the one-dimensional Hubbard model can be extended in a 
straightforward way to higher spatial dimensions. In general, the effect of quantum fluctuations is 
getting less important as the dimensionality of the system is increased. For example, in three spatial 
dimensions the theory presented above works fine in the case of the transitional-metal oxides such 
as NiO and CoO. However, strong electron correlation effects , even in these materials, can lead to 
serious modifications of the mean field ground state. 
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In case of two spatial dimensions things are more complicated. On the one hand, the lack of exact 
solutions, and on the other hand, the strong effect of quantum fluctuations of the strongly correlated 
electron system described by the Hubbard Hamiltonian makes all the presently existing solutions 
questionable. In fact, in the case of the copper oxide high temperature superconductors, which 
involve strongly correlated quasi two-dimensional electron systems, a reliable microscopic theory is 
still lacking. The available mean field theories cannot account for all the striking, unusual physical 
properties of these materials. 
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Chapter 10 

Relativistic Quantum Mechanics 



In this Chapter we will address the issue that the laws of physics must be formulated in a form 
which is Lorentz-invariant, i.e., the description should not allow one to differentiate between frames 
of reference which are moving relative to each other with a constant uniform velocity v. The 
transformations beween such frames according to the Theory of Special Relativity are described 
by Lorentz transformations. In case that v is oriented along the izq-axis, i.e., v = vix±, these 
transformations are 

x\i = — 1 Vl = , t' = — = = , x' 2 = X2 ; x' 3 = X3 (10.1) 



which connect space time coordinates (xi,X2,xs,t) in one frame with space time coordinates 
(oc , 1 ,oc , 2 ,x , s ,tf) in another frame. Here c denotes the velocity of light. We will introduce below 
Lorentz-invariant differential equations which take the place of the Schrodinger equation of a par- 
ticle of mass m and charge q in an electromagnetic field [c.f. (refeq:ham2, 8.45)] described by an 
electrical potential V(f, t) and a vector potential A(r, t) 



ih -^ip(r, t) 



2 



The replacement of (10.2) by Lorentz-invariant equations will have two surprising and extremely 
important consequences: some of the equations need to be formulated in a representation for which 
the wave functions ip(r, t) are vectors of dimension larger one, the components representing the 
spin attribute of particles and also representing together with a particle its anti-particle. We will 
find that actually several Lorentz-invariant equations which replace (10.2) will result, any of these 
equations being specific for certain classes of particles, e.g., spin-0 particles, spin-1 particles, etc. 
As mentioned, some of the equations describe a particle together with its anti-particle. It is not 
possible to uncouple the equations to describe only a single type particle without affecting nega- 
tively the Lorentz invariance of the equations. Furthermore, the equations need to be interpreted 
as actually describing many-particle-systems: the equivalence of mass and energy in relativistic 
formulations of physics allows that energy converts into particles such that any particle described 
will have 'companions' which assume at least a virtual existence. 

Obviously, it will be necessary to begin this Chapter with an investigation of the group of Lorentz 
transformations and their representation in the space of position r and time t. The representation 
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in Sect. 10.1 will be extended in Sect. 10.4 to cover fields, i.e., wave functions ip(f,t) and vectors 
with functions ip(r, t) as components. This will provide us with a general set of Lorentz-invariant 
equations which for various particles take the place of the Schrodinger equation. Before introduc- 
ing these general Lorentz-invariant field equations we will provide in Sects. 10.5, 10.7 a heuristic 
derivation of the two most widely used and best known Lorentz-invariant field equations, namely 
the Klein-Gordon (Sect. 10.5) and the Dirac (Sect. 10.7) equation. 

10.1 Natural Representation of the Lorentz Group 

In this Section we consider the natural representation of the Lorentz group £, i.e. the group of 
Lorentz transformations (10.1). Rather than starting from (10.1), however, we will provide a more 
basic definition of the transformations. We will find that this definition will lead us back to the 
transformation law (10.1), but in a setting of representation theory methods as applied in Secti. 5 
to the groups SO(3) and SU(2) of rotation transformations of space coordinates and of spin. 
The elements L G C act on 4-dimensional vectors of position- and time-coordinates. We will 
denote these vectors as follows 

a" d = (x°,x\x 2 ,x 3 ) (10.3) 

where x° = ct describes the time coordinate and (x 1 , x 2 , x 3 ) = r describes the space coordinates. 
Note that the components of x^ all have the same dimension, namely that of length. We will, 
henceforth, assume new units for time such that the velocity of light c becomes c = 1. This choice 
implies dzm(time) = dim(length). 

Minkowski Space 

Historically, the Lorentz transformations were formulated in a space in which the time component 
of x M was chosen as a purely imaginary number and the space components real. This space is 
called the Minkowski space. The reason for this choice is that the transformations (10.1) leave the 
quantity 

s 2 = (a . 0) 2 _ (x l } 2 _ (a . 2) 2 _ (x 3 } 2 (1Q 4) 

invariant, i.e., for the transformed space-time-cordinates x'^ = (x' ' ) holds 

(x o )2 _ (x i )2 _ (x2)2 _ (a . 3)2 = {x y _ (x /i )2 _ (x / 2)2 _ (x / 3)2 (1Q 5) 

One can interprete the quantity \/—s 2 as a distance in a 4-dimensional Euclidean space if one 
chooses the time component purely imaginary. In such a space Lorentz transformations corre- 
spond to 4-dimensional rotations. Rather than following this avenue we will introduce Lorentz 
transformations within a setting which does not require real and imaginary coordinates. 

The Group of Lorentz Transformations C = 0(3,1) 

The Lorentz transformations L describe the relationship between space-time coordinates x^ of two 
reference frames which move relative to each other with uniform fixed velocity v and which might 
be reoriented relative to each other by a rotation around a common origin. Denoting by x M the 
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coordinates in one reference frame and by x' M the coordinates in the other reference frame, the 
Lorentz transformations constitute a linear transformation which we denote by 



3 

u=0 



r -- y^L^ u x u . (10.6) 



Here v are the elements of a 4 x 4— matrix representing the Lorentz transformation. The upper 
index closer to 'L' denotes the first index of the matrix and the lower index v further away from 
'L' denotes the second index. [ A more conventional notation would be L^ u , however, the latter 
notation will be used for different quantities further below.] The following possibilities exist for the 
positioning of the indices fx, v = 0, 1, 2, 3: 

4-vector: x M , x M ; 4x4 tensor: A» v , A/, A^ v , A^ . (10.7) 

The reason for the notation is two-fold. First, the notation in (10.6) allows us to introduce the 
so-called summation conventon: any time the same index appears in an upper and a lower position, 
summation over that index is assumed without explicitly noting it, i.e., 

3 3 3 

"new" ^ =0 new ^ =0 '^~nsw~~ ' ^ =0 

old old old 

The summation convention allows us to write (10.6) x" 1 = U l v x v . The second reason is that upper 
and lower positions allow us to accomodate the expression (10.5) into scalar products. This will be 
explained further below. 

The Lorentz transformations are non-singular 4 x 4-matrices with real coefficients, i.e., L £ GL(4, 
M), the latter set constituting a group. The Lorentz transformations form the subgroup of all 
matrices which leave the expression (10.5) invariant. This condition can be written 

z*Vx" = xTg^aT (10.9) 

where 



( 1 \ 
0-10 

0-10 

\ o o o -l / 



g . (10.10) 



Combining condition (10.9) and (10.6) yields 

L%g^L% X Px° = g pa xP X ° . (10.11) 
Since this holds for any x M it must be true 

L%g^ v L\ = g pa . (10.12) 



This condition specifies the key property of Lorentz transformations. We will exploit this property 
below to determine the general form of the Lorentz transformations. The subset of GL(4, R), the 
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elements of which satisfy this condition, is called 0(3,1). This set is identical with the set of all 
Lorentz transformations C We want to show now C = 0(3,1) C GL(4, M) is a group. 
To simplify the following proof of the key group properties we like to adopt the conventional matrix 
notation for V- v 



( L\ L\ L\ L° 3 \ 
L 1 q L 1 ! L X 2 L 1 ^ 



L 2 q L 2 i L 2 2 L s z 



L = (IA,) = 

\ L 3 o L 3 i L 3 2 L 3 3 / 
Using the definition (10.10) of g one can rewrite the invariance property (10.12) 



(10.13) 



L T gL 



From this one can obtain using 

(gL T g)L = IL and, hence, the inverse of L 



(10.14) 
(10.15) 



L° 2 L 1 
L 1 



L 1 = gL T g = 

V -L° 3 

The corresponding expression for (L T ) _1 is obviously 

(I/)" 1 = (L _1 ) T 
To demonstrate the group property of 0(3,1), i.e., of 

0(3,1) = { L, L G GL(4, 



I? 
L 2 



(10.16) 



gLg • 



L T gL 



(10.17) 



(10.18) 



we note first that the identity matrix 11 is an element of 0(3,1) since it satisfies (10.14). We consider 
then Li,L 2 G 0(3,1). For L 3 = L : L 2 holds 



L3 g L 3 = L^Lf g LiL 2 = L 2 (L{ gLi) L 2 = L 2 g L 2 = g , 



(10.19) 



i.e., L3 G 0(3,1). One can also show that if L G 0(3,1), the associated inverse obeys (10.14), i.e., 
L _1 G 0(3,1). In fact, employing expressions (10.16, 10.17) one obtains 



(L x ) T gL 1 = gLgggL T g = gLgL T g . 



(10.20) 



Multiplying (10.14) from the right by gL T and using (10.15) one can derive L r gLgL T = L T and 
multiplying this from the left by by g(L T ) _1 yields 

LgL T = g (10.21) 

Using this result to simplify the r.h.s. of (10.20) results in the desired property 

(L- 1 ) T gL- 1 = g, (10.22) 

i.e., property (10.14) holds for the inverse of L. This stipulates that 0(3,1) is, in fact, a group. 
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Classification of Lorentz Transformations 

We like to classify now the elements of C = 0(3,1). For this purpose we consider first the value 
of detL. A statement on this value can be made on account of property (10.14). Using det AB = 
det A&etB and det A T = det A yields (detL) 2 = 1 or 

detL = ±1 . (10.23) 

One can classify Lorentz transformations according to the value of the determinant into two distinct 
classes. 

A second class property follows from (10.14) which we employ in the formulation (10.12). Consid- 
ering in (10.12) the case p = 0, a = yields 

(L%) 2 - {L\f - (L 2 ) 2 - (L 3 ) 2 = 1 . (10.24) 

or since (L^) 2 + (L 2 ) 2 + (L 3 ) 2 > it holds (L° ) 2 > 1. From this we can conclude 

L° > 1 or L° < -1 , (10.25) 

i.e., there exist two other distinct classes. Properties (10.23) and (10.25) can be stated as follows: 
The set of all Lorentz transformations C is given as the union 

£ = C\ U £ l + U Li U £ l _ (10.26) 

where C\, 4> 4, C[_ are disjunct sets defined as follows 

4 = {L,L g 0(3,1), detL = 1, L° > 1} ; (10.27) 

C l + = {L,L G 0(3,1), detL = 1, L° < -1} ; (10.28) 

Cl = {L,L e 0(3,1), detL = -1, L° > 1} ; (10.29) 

C l _ = {L,L e 0(3,1), detL = -1, L° < -1} . (10.30) 

It holds g G C and —11 G C as one can readily verify testing for property (10.14). One can also 
verify that one can write 

Cl = g4 = 4g ; (10.31) 

4 = -4 5 (10.32) 

c l - = -g4 = -4§ ( 10 - 33 ) 

where we used the definition aM. = \M\^Mi,Mi G M.,M\ = aMj}. The above shows that 

t 

the set of proper Lorentz transformations C + allows one to generate all Lorentz transformations, 
except for the trivial factors g and —11. It is, hence, entirely suitable to investigate first only 
Lorentz transformations in C\_. 

We start our investigation by demonstrating that 4 forms a group. Obviously, 4 contains 1. 
We can also demonstrate that for A, B G C + holds C = AB G C \. For this purpose we consider 
the value of C° = A°^ = Y,%i A °j Bj o + ^°o^°o- Schwartz's inequality yields 

EAb j o) < E(^) 2 X> J °) 2 • ( 10 - 34 ) 

3=1 ) 3=1 3=1 
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From (10.12) follows (B° ) 2 - £j =1 (.B J o) 2 = 1 or £f =1 (£ j o) 2 = (B° ) 2 - 1. Similarly, one can 
conclude from (10.21) Ej=i(^°i) 2 = (^°o) 2 ~ 1- (10.34) provides then the estimate 

E^oj < [(^°o) 2 " 1] [(^°o) 2 " 1] < (^°o) 2 (^o) 2 • (10-35) 

One can conclude, therefore, | J2^=i A°jB^o\ < ^4°o-B°o. Since A°q > 1 and B°o > 1, obviously 
A°oB°o > 1. Using the above expression for C°o one can state C°o > 0. In fact, since the group 

property of 0(3,1) ascertains C T gC = g it must hold C°o > 1. 

t 

The next group property of C + to be demonstrated is the existence of the inverse. For the inverse of 
any L G holds (10.16). This relationship shows (L _1 )° = L%, from which one can conclude 
L _1 G C\. We also note that the identity operator l has elements 

!"„ = (10.36) 

where we defined 1 

= { I T'T ( 10 - 37 ) 

[ for jjL f: v 

It holds, IL o = > 1 and, hence, IL G £+. Since the associative property holds for matrix multipli- 
cation we have verified that C\ is indeed a subgroup of SO(3,l). 

t 

C + is called the subgroup of proper, orthochronous Lorentz transformations. In the following we 
will consider solely this subgroup of SO (3,1). 



Infinitesimal Lorentz transformations 

t 

The transformations in C + have the property that they are continously connected to the identity IL , 
i.e., these transformations can be parametrized such that a continuous variation of the parameters 
connects any element of C\ with IL. This property will be exploited now in that we consider first 

transformations in a small neighborhood of IL which we parametrize by infinitesimal parameters. 

t 

We will then employ the Lie group properties to generate all transformations in C + . 
Accordingly, we consider transformations 

L^ v = 5^ v + e* v ; e* v small . (10.38) 

For these transformations, obviously, holds L°o > and the value of the determinant is close to 
unity, i.e., if we enforce (10.14) actually L°o > 1 and detL = 1 must hold. Property (10.14) 
implies 

(1 + e T ) g (1 + e) = g (10.39) 
where we have employed the matrix form e defined as in (10.13). To order 0(e 2 ) holds 

e T g + ge = . (10.40) 

1 It should be noted that according to our present definition holds 5^ = g^ p & r ' '„ and, accordingly, S o = 1 and 
Sn — 822 = = — 1. 
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Using (10.15) one can conclude 



geg 



(10.41) 



which reads explicitly 



e°2 



\ e 3 

This relationship implies 



e 2 o 



( ~e°o 



6°2 



-e 2 3 



(10.42) 



= 

= e\ 



3 = 1,2,3 



eh 



e i i J> k — 1, 2, 3 

Inspection shows that the matrix e has 6 independent elements and can be written 

/ 



(10.43) 



— Wi —W2 

Wl — $3 

W2 $3 

\ —^3 — #2 #1 



-1U 3 \ 

#2 
-01 

o / 



This result allows us now to define six generators for the Lorentz transformations^ 

Jfc = e(i9fc = 1, other five parameters zero) 

Kfc = e(wk = 1, other five parameters zero) . 
The generators are explicitly 



(10.44) 

1,2,3) 
(10.45) 
(10.46) 



Ji 



/ 





\ o o i 



o \ 


-1 
o / 



/ o 




V o 



\ 

1 



-10 0/ 



/ 



1 

\ o o 



\ 

-1 



0/ 



(10.47) 



Ki = 



K 5 



K, 



-1 0\ 


0/ 

These commutators obey the following commutation relationships 



( 





-1 





o \ 




-1 

























\ 











o J 



( 

V 





-1 










-10 



-1 







(10.48) 



[Jfcj 2e] 
[K fc , K t ] 
[3k, K e ] 



(10.49) 



The operators also obey 

J K = JiJi + J 2 J 2 + J3J3 = 



(10.50) 
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as can be readily verified. 



Exercise 7.1: 

Demonstrate the commutation relationships (10.49, 10.50). 



The commutation relationships (10.49) define the Lie algebra associated with the Lie group C\. 
The commutation relationships imply that the algebra of the generators k = 1, 2, 3 is closed. 

Following the treatment of the rotation group SO(3) one can express the elements of C\_ through 
the exponential operators 



L(<?, w) 



exp ( ■& ■ J + ttJ • K 



) ; 



(10.51) 



where we have defined # ■ J = X)fc=i "^fcJfc and w; • K = ^jfc=i w k^k- One can readily show, 
following the algebra in Chapter 5, and using the relationship 





L k 



(10.52) 



where the 3 x 3-matrices are the generators of SO (3) defined in Chapter 5, that the transfor- 
mations (10.51) for w = correspond to rotations of the spatial coordinates, i.e., 



L(0, w = 0) 





R(S) 



(10.53) 



Here R($) are the 3 x 3-rotation matrices constructed in Chapter 5. For the parameters of the 
Lorentz transformations holds obviously 



tf fc £ [0,2tt[ , k = 1,2,3 



(10.54) 



which, however, constitutes an overcomplete parametrization of the rotations (see Chapter 5). 
We consider now the Lorentz transformations for $ = which are referred to as 'boosts'. A boost 
in the xi-direction is L = exp(u>iKi). To determine the explicit form of this transformation we 
evaluate the exponential operator by Taylor expansion. In analogy to equation (5.35) it issumcient 
to consider in the present case the 2 x 2-matrix 



ll 



exp w\ 







-1 




^ n! 



n=0 







(10.55) 



since 



exp(ttJiKi) = exp 
Using the idempotence property 



-1 
-1 



V 





\ o o 



1 
1 



\ 



1 
1 / 



1 



(10.56) 



(10.57) 
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one can carry out the Taylor expansion above: 

00 „2n 00 „„2ra+l 



~^(2n)! Z ^(2n + 1)!\-1 J 

n=0 v ; n=0 v ; v 7 



(10.58) 



cosh w\ 1 + sinh iui 



-1 



cosh iwi —sinh i«i 
—sinh i«i cosh w\ 



The conventional form (10.1) of the Lorentz transformations is obtained through the parameter 
change 

sinhiui 

v\ = = tanh^i (10.59) 

cosh w\ 



Using cosh 2 u>i — sinh 2 ^! = 1 one can identify sinh^i = \J cosh 2 u>i — 1 and coshtui = 
Vsinh 2 ^ + 1. Correspondingly, one obtains from (10.59) 



Vl 



\/ cosh 2 w\ — 1 
cosh w\ 



sinhu>i 



\/ sinh 2 tui + 1 



(10.60) 



These two equations yield 



cosh w\ = 1 / y 1 — v 2 ; sinh w\ = v\/yl — vf , 



(10.61) 



and (10.56, 10.59) can be written 



exp (jiiiKi 



1 



-vi 



\ 



-v\ ^vj 

— — — 1 — 



V 








■ V 








1 
1/ 



(10.62) 



According to (10.3, 10.6, 10.51) the explicit transformation for space-time-coordinates is then 



x[ 



Xl — V\t 



t' 



VlXl , 
> x 2 



x 2 , x 3 



^3 



(10.63) 



which agrees with (10.1). 

The range of the parameters Wk can now be specified, vt defined in (10.59) for the case k = 1 
corresponds to the relative velocity of two frames of reference. We expect that vt can only assume 
values less than the velocity of light c which in the present units is c = 1. Accordingly, we can 
state Vk G ] — 1, 1[. This property is, in fact, consistent with (10.59). From (10.59) follows, however, 
for Wk 

w k G ] - 00, 00 [ . (10.64) 

We note that the range of u^-values is not a compact set even though the range of v^-values is 
compact. This property of the u>fc-values contrasts with the property of the parameters $k specifying 
rotational angles which assume only values in a compact range. 
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10.2 Scalars, 4— Vectors and Tensors 

In this Section we define quantities according to their behaviour under Lorentz transformations. 
Such quantities appear in the description of physical systems and statements about transformation 
properties are often extremely helpful and usually provide important physical insight. We have 
encountered examples in connection with rotational transformations, namely, scalars like r = 
\J x\ + x\ + xf, vectors like f = (x\, X2, x^) T , spherical harmonics Y&n(r), total angular momentum 
states of composite systems like y£ m ((i, (2^1,^2) and, finally, tensor operators T^ m . Some of these 
quantities were actually defined with respect to representations of the rotation group in function 
spaces, not in the so-called natural representation associated with the 3-dimensional Euclidean 
space E 3 . 

Presently, we have not yet defined representations of Lorentz transformations beyond the 'natural' 
representation acting in the 4-dimensional space of position- and time-coordinates. Hence, our 
definition of quantities with special properties under Lorentz transformations presently is confined 
to the natural representation. Nevertheless, we will encounter an impressive example of physical 
properties. 



Scalars The quantities with the simplest transformation behaviour are so-called scalars / £ M 
which are invariant under transformations, i.e., 

/' = / . (10.65) 

An example is s 2 defined in (10.4), another example is the rest mass m of a particle. However, not 
any physical property / G K is a scalar. Counterexamples are the energy, the charge density, the 
z-component £3 of a particle, the square of the electric field \E(r, t)\ 2 or the scalar product r\ ■ r*2 
of two particle positions. We will see below how true scalars under Lorentz transformations can be 
constructed. 



4- Vectors The quantities with the transformation behaviour like that of the position-time vector 
x^ defined in (10.3) are the so-called 4-vectors a M . These quantites always come as four components 
(a , a 1 , a 2 , a 3 ) T and transform according to 

a ,M = L V . (10.66) 

Examples of 4-vectors beside x M are the momentum 4-vector 

771 771 7? 

" = (E * £ =VfTf p '=^ (la67) 

the transformation behaviour of which we will demonstrate further below. A third 4-vector is the 
so-called current vector 

= (p, J) (10.68) 

where p(f, t) and J(r, t) are the charge density and the current density, respectively, of a system of 
charges. Another example is the potential 4-vector 

= (V, A) (10.69) 

where V(r, t) and A(r, t) are the electrical and the vector potential of an electromagnetic field. The 
4-vector character of and of A^ will be demonstrated further below. 
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Scalar Product 4-vectors allow one to construct scalar quantities. If a M and are 4-vectors 
then 

a»g^b v (10.70) 
is a scalar. This property follows from (10.66) together with (10.12) 

a"Vr = L% g>xv L v a ofb° = a<>g p<J b° (10.71) 

Contravariant and Covariant 4- Vectors It is convenient to define a second class of 4-vectors. 
The respective vectors a p are associated with the 4-vectors a M , the relationship being 

a„ = g^a" = (a , -a 1 , -a 2 , -a 3 ) (10.72) 

where a v is a vector with transformation behaviour as stated in (10.66). One calls 4-vectors a M 
covariant and 4-vectors a M contravariant. Covariant 4-vectors transform like 

oV = 9^L%g^a a (10.73) 

where we defined 

9^ = 9^ ■ (10.74) 

We like to point out that from definition (10.72) of the covariant 4-vector follows a p = g^ u a v . In 
fact, one can employ the tensors g^ v and g pu to raise and lower indices of L^ v as well. We do not 
establish here the consistency of the ensuing notation. In any case one can express (10.73) 

a' p = L^a a . (10.75) 

Note that according to (10.17) L^ 7 is the transformation inverse to L° p . In fact, one can express 
[{l,- 1 ) T f u = {L- l Y^ and, accordingly, (10.17) can be written 

(L' 1 )^ = L/ • (10-76) 
The 4- Vector d p An important example of a covariant 4-vector is the differential operator 

9 » = t» = (£• v ) < ia77 > 

The transformed differential operator will be denoted by 

d' = . (10.78) 

To prove the 4-vector property of we will show that g^ v d v transforms like a contravariant 4- 
vector, i.e., g^ v d' v = L^ p g pa d a . We start from x /fl = L^ v x v . Multiplication (and summation) of 
x ,M = L» v x v by LP a g m yields, using (10.12), g av x v = ^i" 1 and gT g av = 

x v = g va L^g m x'^ . (10.79) 

This is the inverse Lorentz transformation consistent with (10.16). We have duplicated the expres- 
sion for the inverse of L^ u to obtain the correct notation in terms of covariant, i.e., lower, and 
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contravariant, i.e., upper, indices. (10.79) allows us to determine the connection between d p and 
& )X . Using the chain rule of differential calculus we obtain 

9; = fe = srv^dr = l»q v (lo.so) 

Multiplication by (and summation over u) together with g XfJ- g Pfl = 5 X P yields 

g x % = L\g™d v , (10.81) 
i.e., d p does indeed transform like a covariant vector. 

d'Alembert Operator We want to construct now a scalar differential operator. For this purpose 
we define first the contravariant differential operator 

0f = gTd,, = -v) . (10.82) 

Then the operator 

8^ = d 2 - V 2 (10.83) 

is a scalar under Lorentz transformations. In fact, this operator is equal to the d'Alembert operator 
which is known to be Lorentz-invariant. 

Proof that p^ is a 4- vector We will demonstrate now that the momentum 4-vector p^ defined 
in (10.67) transforms like (10.66). For this purpose we consider the scalar differential 



(dr) 2 = dx^dx^ = (dt) 2 - (dr) 2 (10.84) 



It holds 



from which follows 



One can write 



dr^ 2 



= 1 - (v) 2 (10.85) 

(10.86) 



dt 

d 1 d 



dr Vl - v 2 dt ' 
E m m dt 



The remaining components of p^ can be written, e.g., 



i mv 1 m 

P ~ v 7 ! - v 2 ~ V 7 ! - v 2 ^t ' ' 



One can express then the momentum vector 



10.3: Relativistic Electrodynamics 



297 



The operator m £p transforms like a scalar. Since x M transforms like a contravariant 4- vector, the 
r.h.s. of (10.89) alltogether transforms like a contravariant 4-vector, and, hence, p^ on the l.h.s. of 
(10.89) must be a 4-vector. 

The momentum 4-vector allows us to construct a scalar quantity, namely 

P% = P"W = E 2 - f (10.90) 
Evaluation of the r.h.s. yields according to (10.67) 



„o -.0 f'l> III U o . . 

E — p = - - - ^ = m 2 (10.91) 



2 2 -»2 

m m v 

1 — v 2 1 — v 

or 



= m 2 (10.92) 
which, in fact, is a scalar. We like to rewrite the last result 

E 2 = p 2 + m 2 (10.93) 

or 

£ = iyV 2 + ™ 2 • (10.94) 

In the non- relativistic limit the rest energy m is the dominant contribution to E. Expansion in ^ 
should then be rapidly convergent. One obtains 

E = ±m± f^pl + ((p!_), (10 . 95) 

2m 4m 3 \ 4m 5 J v ; 

This obviously describes the energy of a free particle with rest energy ±m, kinetic energy ±^ and 
relativistic corrections. 



10.3 Relativistic Electrodynamics 

In the following we summarize the Lorentz-invariant formulation of electrodynamics and demon- 
strate its connection to the conventional formulation as provided in Sect. 8. 



Lorentz Gauge In our previous description of the electrodynamic field we had introduced the 
scalar and vector potential V(f,t) and A(f,t), respectively, and had chosen the so-called Coulomb 
gauge (8.12), i.e., V • A = 0, for these potentials. This gauge is not Lorentz-invariant and we will 
adopt here another gauge, namely, 

dtV{r, t) + V • A(r, t) = . (10.96) 

The Lorentz-invariance of this gauge, the so-called Lorentz gauge, can be demonstrated readily 
using the 4-vector notation (10.69) for the electrodynamic potential and the 4-vector derivative 
(10.77) which allow one to express (10.96) in the form 

dpA* = 0. (10.97) 

We have proven already that is a contravariant 4-vector. If we can show that A^ defined in 
(10.69) is, in fact, a contravariant 4-vector then the l.h.s. in (10.97) and, equivalently, in (10.96) is 
a scalar and, hence, Lorentz-invariant. We will demonstrate now the 4-vector property of A^. 
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Transformation Properties of J M and 

The charge density p(r, t) and current density J(r, t) are known to obey the continuity property 

d t p(f, t) + V • J(r, t) = (10.98) 

which reflects the principle of charge conservation. This principle should hold in any frame of 
reference. Equation (10.98) can be written, using (10.77) and (10.68), 

d n J^{x^) = . (10.99) 

Since this equation must be true in any frame of reference the right hand side must vanish in all 
frames, i.e., must be a scalar. Consequently, also the l.h.s. of (10.99) must be a scalar. Since d u 
transforms like a covariant 4-vector, it follows that J M , in fact, has to transform like a contravariant 
4- vector. 

We want to derive now the differential equations which determine the 4-potential A^ in the Lorentz 
gauge (10.97) and, thereby, prove that A^ is, in fact, a 4-vector. The respective equation for A = V 
can be obtained from Eq. (8.13). Using V • dtA(f,t) = <9 t V • A(r, t) together with (10.96), i.e., 
V • A(f, t) = —dtV(f, t), one obtains 

d?V(r,t) - V 2 V(f,t) = 4irp(r,t) . (10.100) 

Similarly, one obtains for A(f, t) from (8.17) using the identity (8.18) and, according to (10.96), 
V-A{f,t) = -d t V(f,t) 

d?A(r,t) - V 2 A(f,t) = 4nJ(f,t). (10.101) 
Combining equations (10.100, 10.101), using (10.83) and (10.69), yields 

d^A v (x a ) = 4vr«r(x CT ). (10.102) 

In this equation the r.h.s. transforms like a 4-vector. The l.h.s. must transform likewise. Since 
d^d^ transforms like a scalar one can conclude that A v {x a ) must transform like a 4-vector. 



The Field Tensor 

The electric and magnetic fields can be collected into an anti-symmetric 4x4 tensor 



Alternatively, this can be stated 



? k0 



—F' 
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—B x 
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B x 





pmn 




mni j^l 





(10.103) 



(10.104) 



where e mni = e mn e is the totally anti-symmetric three-dimensional tensor defined in (5.32). 

One can readily verify, using (8.6) and (8.9), that F^ u can be expressed through the potential A^ 

in (10.69) and 8" in (10.82) as follows 



(10.105) 
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The relationships (10.103, 10.104) establishe the transformation behaviour of E(r,t) and B(r,t). 
In a new frame of reference holds 

F ^u = L » aL v pF »P (10.106) 

In case that the Lorentz transformation L^ v is given by (10.62) or, equivalently, by (10.63), one 
obtains 

/ 
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E x 





E y -v 1 B z 


B z -v±E y 








B y +viE z 
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Ey-viB z 


E z +vtB y 






B z -v 1 E y 


By+V 1 E Z 









—B x 


B x 






(10.107) 



Comparision with 



EL 



-E' x 



—B' 







E'y B' z 

By B' x 



\ E' z 



-K \ 

-B'x 
J 



(10.108) 



yields then the expressions for the transformed fields E' and B'. The results can be put into the 
more general form 



E'\\ - E\\ , E' ± 



B'\\ = B» , B' ± = 



B 



Bi 



— v 2 
v x E 



V 2 



(10.109) 
(10.110) 



where En, B» and E±, B± are, respectively, the components of the fields parallel and perpendicular 
to the velocity v which determines the Lorentz transformation. These equations show that under 
Lorentz transformations electric and magnetic fields convert into one another. 

Maxwell Equations in Lorentz-Invariant Form 

One can express the Maxwell equations in terms of the tensor F^ u in Lorentz-invariant form. Noting 

d^ v = d^A v - d^A* = d^A u - WdpA* = d^A u , (10.111) 

where we used (10.105) and (10.97), one can conclude from (10.102) 

dpF*" = 4vr J u . (10.112) 

One can readily prove that this equation is equivalent to the two inhomogeneous Maxwell equations 
(8.1, 8.2). From the definition (10.105) of the tensor F^ v one can conclude the property 

ffT F l»> + Q^ F ™ + grp*!* = o (10.113) 
which can be shown to be equivalent to the two homogeneous Maxwell equations (8.3, 8.4). 
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Lorentz Force 

One important property of the electromagnetic field is the Lorentz force acting on charged particles 
moving through the field. We want to express this force through the tensor F^ v ' . It holds for a 
particle with 4-momentum p^ as given by (10.67) and charge q 

^ = — p v F M " (10.114) 
dr m 

where d/dr is given by (10.86). We want to demonstrate now that this equation is equivalent to 
the equation of motion (8.5) where p = mv/\/l — v 2 . 

To avoid confusion we will employ in the following for the energy of the particle the notation 
£ = m/y/l — v 2 [see (10.87)] and retain the definition E for the electric field. The u, = 
component of (10.114) reads then, using (10.104), 

d 4- = ^p-E (10.115) 
dr m 



or with (10.86) 



(10116) 



From this one can conclude, employing (10.93), 

ld£ 2 I dp 2 _ . 

-— = — V- = QP' E (10.117 
2 dt 2 dt v ' 

This equation follows, however, also from the equation of motion (8.5) taking the scalar product 
with p 

p.^ = qf ;.E (10.118) 

where we exploited the fact that according to p = mvj\J\ — v 2 holds p \\ v. 
For the spatial components, e.g., for \i = 1, (10.114) reads using (10.103) 

= ±(g Ex + p y B z - p z B y ). (10.119) 
dr m 



Employing again (10.86) and (10.67), i.e., £ = m/\/l — v 2 , yields 

E x + (vxB) x ] (10.120) 



dp x 
dt 



which is the x-component of the equation of motion (8.5). We have, hence, demonstrated that 
(10.114) is, in fact, equivalent to (8.5). The term on the r.h.s. of (10.120) is referred to as the 
Lorentz force. Equation (10.114), hence, provides an alternative description of the action of the 
Lorentz force. 
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10.4 Function Space Representation of Lorentz Group 

In the following it will be required to decribe the transformation of wave functions under Lorentz 
transformations. In this section we will investigate the transformation properties of scalar functions 
ip(x^), if) € Cqo (4). For such functions holds in the transformed frame 

ip'{L^ u x v ) = V(x^) (10.121) 

which states that the function values ip'(x'^) at each point x'^ in the new frame are identical to the 
function values ^(x^) in the old frame taken at the same space-time point x^, i.e., taken at the pairs 
of points (x /fl = L fl 1/ x u , x^). We need to emphasize that (10.121) covers solely the transformation 
behaviour of scalar functions. Functions which represent 4-vectorsor other non-scalar entities, e.g., 
the charge-current density in case of Sect. 10.3 or the bi-spinor wave function of electron-positron 
pairs in Sect. 10.7, obey a different transformation law. 

We like to express now ^\x'^) in terms of the old coordinates For this purpose one replaces 
x» in (10.121) by {L~ 1 Y v x v and obtains 

0V) = iK(L- 1 )VO ■ (10-122) 

This result gives rise to the definition of the function space representation p(L^ u ) of the Lorentz 
group 

(p(ZA)V)K) = ^{{L- 1 ) \x v ) ■ (10.123) 

This definition corresponds closely to the function space representation (5.42) of SO(3). In analogy 
to the situation for SO(3) we seek an expression for p{L^ u ) in terms of an exponential operator and 
transformation parameters w, i.e., we seek an expression which corresponds to (10.51) for the 
natural representation of the Lorentz group. The resulting expression should be a generalization of 
the function space representation (5.48) of SO(3), in as far as SO(3,l) is a generalization (rotation 
+ boosts) of the group SO(3). We will denote the intended representation by 

£0, w) = p(L^($,w)) = p(e* I+ * g ) 

which we present in the form 

£(S,w) = exp(tf-J + w-K} . (10.125) 

In this expression J = (Ji, J2, J3) and K. = (/Ci, /C2, /C3) are the generators of £(i}, w) which 
correspond to the generators Jk and Kj- in (10.47), and which can be constructed following the 
procedure adopted for the function space representation of SO (3). However, in the present case we 
exclude the factor '— V [cf. (5.48) and (10.125)]. Accordingly, one can evaluate Jk as follows 

Jk = km \- \p(e^ Jk ) - H 

and K-k 

K k = lim — \p(e WkKk ) - H] 



(10.124) 



(10.126) 



(10.127) 
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One obtains 



Jl 


= X 3 d 2 


— x 2 ds ; 




= X°d! 
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x 1 d 
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= x 1 d 3 


— x 3 di ; 
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= x°d 2 
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x 2 d 


J3 


= x 2 d\ 


- x 1 d 2 ; 


/c 3 


= x°d 3 


+ 


x 3 d 



(10.128) 



which we like to demonstrate for J\ and K\. 

In order to evaluate (10.126) for J\ we consider first 



Ji 



( 1 \ 

'01 

cosi?i sin$i 

\ — sim?i cosi?i J 



(10.129) 



which yields for small t?i 



p {e dlJl ) tp(x^) = ip(x°,x 1 ,cos'd 1 x 2 + sini?ix 3 , -sini?i x 2 + cosi?i x 3 ) 
= i{j(x>*) + ^1 {x 3 8 2 - x 2 8 3 ) iP(x») + 0(i??) . 

This result, obviously, reproduces the expression for J\ in (10.128). 
One can determine similarly /Ci starting from 



(10.130) 



-w\K\ 



/ cosh^i sinhtt;i \ 

sinht^i cosh^i 

10 

\ 1 / 



(10.131) 



This yields for small w\ 



p (e WlKl ^) ^(x^) = ^(cosht^i x° + sinhwix 1 , smhw\x° + cosht^ix 1 , x 2 , 
= ^(x") + wi {x l d + x°di) Y>(x M ) + 0(wj) 



(10.132) 



and, obviously, the expression for /Ci in (10.126). 

The generators J, /C obey the same Lie algebra (10.49) as the generators of the natural represen- 
tation, i.e. 



[Jk, Je] 
[/Cfc, /Q] 

[Jk, fce] 



~ ^klrn Jm 
€k£m K-m ■ 



(10.133) 

We demonstrate this for three cases, namely [1/1,1/2] = J3, [K.i,K.2] = —Jz-, and [/i,/C2] = /C3: 

[/i,/ 2 ] 



[x 3 d 2 - x 2 d 3 , x l d 3 - x 3 di] 
[x 3 d 2 , x^s] - [x 2 d 3 , x 3 9i] 
-x 1 8 2 + x 2 d 1 = J 3 , 



(10.134) 



10.4: Function Space Representation of Lorentz Group 



303 



[/Ci, K 2 ] = [x°d! + x 1 d , x°d 2 + x 2 d ] 
= [x°d!, x 2 d ] - [x'do, x°d 2 ] 

= -x 2 di + x 1 d 2 = -J 3 , (10.135) 

[j u k 2 ] = [x 3 d 2 - x 2 d 3 , x°d 2 + x 2 d ] 
= [x 3 d 2 , x 2 d ] - [x 2 d 3 , x°d 2 ] 

= x 3 d + x°d 3 = /C 3 . (10.136) 
One-Dimensional Function Space Representation 

The exponential operator (10.125) in the case of a one-dimensional transformation of the type 

C(w 3 ) = exp(w 3 JC 3 ) , (10.137) 

where /C3 is given in (10.128), can be simplified considerably. For this purpose one expresses /C3 in 
terms of hyperbolic coordinates R, n which are connected with x°, x 3 as follows 

x° = i?coshft, x 3 = i?sinhft (10.138) 

a relationship which can also be stated 

f+vt^FW ifx°>o 

K ~ \ - ^( X °) 2 - (x3)2 ifx°<0 [ U M) 



X 3 , „ 2° 



and 

tanhO = ^ , cothft = ^ . (10.140) 

x u a;- 3 

The transformation to hyperbolic coordinates closely resembles the transformation to radial coordi- 
nates for the generators of SO(3) in the function space representation [cf. Eqs. (5.85-5.87)]. In both 
cases the radial coordinate is the quantity conserved under the transformations, i.e., \J x\ + x\ + x 2 
in the case of SO(3) and y/(x ) 2 - (x 3 ) 2 in case of transformation (10.137). 

In the following we consider solely the case x° > 0. The relationships (10.139, 10.140) allow one to 
express the derivatives do, 83 in terms of We note 

dR x° dR x° 
dx° ~ H ' dx 3 ~~R 



(10.141) 



and 



dn dn dtanhft 



= cosh 2 17 



<9x 3 <9tanhS7 <9x 3 x° 

an dn dmthn . , 2n 1 

— r = —. ^— = -sinh 2 lW. (10.142) 

dx° dcothft dx° x 3 y ' 



The chain rule yields then 



dR d dn d x° d . 2 1 d 

d ° ~ dx^dR + dx°d~n ~ rM ~ s IcJdn 

dR d dn d x 3 d , 2n l 3 ., „„, 

93 =^^ + ^^ = -7?^ + cosh ^M- (10 - 143) 
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Inserting these results into the definition of IC3 in (10.128) yields 

£3 = x°d 3 + x 3 d = ^ . (10.144) 

The action of the exponential operator (10.137) on a function f(Q) G Coo(l) is then that of a shift 
operator 

C(w 3 )f(n) = exp(^ 3 ^ /(fi) = /(n + u; 3 ) . (10.145) 

10.5 Klein— Gordon Equation 

In the following Sections we will provide a heuristic derivation of the two most widely used quan- 
tum mechanical descriptions in the relativistic regime, namely the Klein-Gordon and the Dirac 
equations. We will provide a 'derivation' of these two equations which stem from the historical de- 
velopment of relativistic quantum mechanics. The historic route to these two equations, however, 
is not very insightful, but certainly is short and, therefore, extremely useful. Further below we will 
provide a more systematic, representation theoretic treatment. 



Free Particle Case 

A quantum mechanical description of a relativistic free particle results from applying the correspon- 
dence principle, which allows one to replace classical observables by quantum mechanical operators 
acting on wave functions. In the position representation the correspondence principle states 

E =► E = - h d t 

i 

p =► p = -rV (10.146) 

which, in 4-vector notation reads 

= ifr(d t ,V) = ihd^; p» = i(d t ,-V) = ihd* . (10.147) 

Applying the correspondence principle to (10.92) one obtains the wave equation 

-h 2 d^d^{x u ) = m 2 tP(x u ) (10.148) 



or 

(ft 2 ^d M + m 2 ) i)(x") = 0. (10.149) 

where ip(x^) is a scalar, complex-valued function. The latter property implies that upon change of 
reference frame ^(x M ) transforms according to (10.121, 10.122). The partial differential equation 
(10.151) is called the Klein-Gordon equation. 

In the following we will employ so-called natural units H = c = 1. In these units the quantities 
energy, momentum, mass, (length) -1 , and (time) -1 all have the same dimension. In natural units 
the Klein-Gordon equation (10.151) reads 



(d^ + m 2 ) i;{x») = 



(10.150) 
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or 

(d 2 - V 2 + m 2 ) ^(x^) = 0. (10.151) 

One can notice immeadiately that (10.150) is invariant under Lorentz transformations. This follows 
from the fact that and m 2 are scalars, and that (as postulated) ip(x )l ) is a scalar. Under 

Lorentz transformations the free particle Klein-Gordon equation (10.150) becomes 

m 2 ) <//(x' M ) = (10.152) 
which has the same form as the Klein-Gordon equation in the original frame. 



Current 4- Vector Associated with the Klein-Gordon Equation 

As is well-known the Schrodinger equation of a free particle 

idMr,t) = -^V 2 V(r,t) (10.153) 

is associated with a conservation law for particle probability 

d tPs (f,t) + V-j s (?,t) = (10.154) 

where 

Ps (r,t) = i;*(r,t)i;(f,t) (10.155) 
describes the positive definite probability to detect a particle at position r at time t and where 

ls(r,t) = ^-[r(r,tW(f,t) ~ mt)Vr(r,t)} (10.156) 



describes the current density connected with motion of the particle probability distribution. To 
derive this conservation law one rewrites the Schrodinger equation in the form (idt — ^ V 2 )t/> = 
and considers 

Im V* [idt ~ ^ V ") ^ 
which is equivalent to (10.154). 

In order to obtain the conservation law connected with the Klein-Gordon equation (10.150) one 
considers 

Im [Y>* (d^ + m 2 ) ip] = (10.158) 

which yields 

^*<9 2 V> - MV* - V^v 2 ^ + ^v 2 v>* 

d t {ip*d t ^ - MV'*) + V • (^VV>* - ip*Vi/>) = (10.159) 



which corresponds to 
where 



9 t pKG(f,t) + V-j K G(r,t) = (10.160) 



3KG(f,t) = ^-(^*(r,t)ViP(f,t) - iP(f,t)Vr(r,t)) . (10.161) 
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This conservation law differs in one important aspect from that of the Schrodinger equation (10.154), 
namely, in that the expression for pxc 1S n °t positive definite. When the Klein-Gordon equation 
had been initially suggested this lack of positive definiteness worried physicists to a degree that the 
Klein-Gordon equation was rejected and the search for a Lorentz-invariant quantum mechanical 
wave equation continued. Today, the Klein-Gordon equation is considered as a suitable equation 
to describe spin-0 particles, for example pions. The proper interpretation of pKG(r,t), it had been 
realized later, is actually that of a charge density, not of particle probability. 



Solution of the Free Particle Klein Gordon Equation 

Solutions of the free particle Klein-Gordon equation are 

iP(x») = Ne" ip ^ = Ne i{ P°- p - E °V . (10.162) 
Inserting this into the Klein-Gordon equation (10.151) yields 

(E 2 - $1 - m 2 ) ij){r,t) = (10.163) 
which results in the expected [see (10.93] dispersion relationship connecting Eq, po, m 

El = m 2 + p 2 . (10.164) 

The corresponding energy is 

E (p , ±) = ± Vm 2 + p 2 (10.165) 

This result together with (10.162) shows that the solutions of the free particle Klein-Gordon equa- 
tion (10.150) are actually determined by p and by the choice of sign ±. We denote this by 
summarizing the solutions as follows 

(d^ + m 2 ) ip (p,\\x») = (10.166) 
MP,M^) = Ar Aipe ^- A ^(P» E (p) = Vm 2 + p 2 ,X =± 

The spectrum of the Klein-Gordon equation (10.150) is a continuum of positive energies E > m, 
corresponding to A = +, and of negative energies E < —m, corresponding to A = — . The density 
Pkg(p,^) associated with the corresponding wave functions ip (p, A|x M ) according to (10.161) and 
(10.166) is 

Pkg(pA) = \^-r o ^AW^oipAW (10-167) 

which is positive for A = + and negative for A = — . The proper interpretation of the two cases is 
that the Klein-Gordon equation describes particles as well as anti-particles; the anti- particles carry 
a charge opposite to that of the associated particles, and the density pkg(p,^) actually describes 
charge density rather than probability. 

Generating a Solution Through Lorentz Transformation 

A particle at rest, i.e., with p = 0, according to (??) is decribed by the r-independent wave 
function 

?p o (p = 0,\\x») = Ne- iXmt , A = ±. (10.168) 
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We want to demonstrate now that the wave functions for p ^ in (??) can be obtained through 
appropriate Lorentz transformation of (10.168). For this purpose we consider the wave function for 
a particle moving with momentum velocity v in the direction of the x 3 -axis. Such wave function 
should be generated by applying the Lorentz transformation in the function space representation 
(10.145) choosing ^ = sinhu> 3 . This yields, in fact, for the wave function (10.168), using (10.138) 
to replace t = x° by hyperbolic coordinates R, CI, 



= N e - iXmRcosh (n+w s ) _ (10.169) 

The addition theorem of hyperbolic functions cosh(fi + tt; 3 ) = coshfi coshu; 3 + sinhfi sinhti; 3 allows 
us to rewrite the exponent on the r.h.s. of (10.169) 

-iX ( m coshiw 3 ) ( R coshft ) - iX ( msinlra; 3 ) ( R sinMl ) . (10.170) 

The coordinate transformation (10.138) and the relationships (10.61) yield for this expression 

-iX -. m x° - iX mV x 3 . (10.171) 



Vi - v 2 Vi 



V 



One can interpret then for A = +, i.e., for positive energy solutions, 

p = -mv/y/l - v 2 (10.172) 
as the momentum of the particle relative to the moving frame and 



m I m 2 / „ m 2 v 2 



VY — 2 - V x _ , 2 = + Y—^ 2 = V^T^ = E (p) (10.173) 
as the energy [c.f. (10.166)] of the particle. In case of A = + one obtains finally 

£(™ 3 )V>o(P = 0,A = +K) = Ne i{ P x3 - Eai P>° (10.174) 

which agrees with the expression given in (10.166). In case of A = — , i.e., for negative energy 
solutions, one has to interprete 

p = mv/y/l — v 2 (10.175) 
as the momentum of the particle and one obtains 

C(w 3 )MP = 0, A = -\x») = Ne i( P x3 + E °W x0 . (10.176) 



10.6 Klein— Gordon Equation for Particles in an Electromagnetic 
Field 

We consider now the quantum mechanical wave equation for a spin-0 particle moving in an elec- 
tromagnetic field described by the 4-vector potential 

A"0r") = (V(r,t),A(r,t)); A^) = (V(r,t), -A(f,t)) (10.177) 
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Table fO.f: 

Coupling of a particle of charge q to an electromagnetic field described by the 4-vector potential 
A^ = (V, A)oxA il = (V, — A). According to the so-called minimum coupling principle the presence 
of the field is accounted for by altering energy, momenta for classical particles and the respective 
operators for quantum mechanical particles in the manner shown. See also Eq. (10.147). 



To obtain the appropriate wave equation we follow the derivation of the free particle Klein-Gordon 
equation above and apply again the correspondence principle to (10.93), albeit in a form, which 
couples a particle of charge q to an electromagnetic field described through the potential A fJL {x u ). 
According to the principle of minimal coupling [see (10.69)] one replaces the quantum mechanical 
operators, i.e., idt and — iV in (10.150), according to the rules shown in Table 10.1. For this purpose 
one writes the Klein-Gordon equation (10.150) 



(-<r(*W&) + m 2 ) ^) 



. 



According to the replacements in Table 10.1 this becomes 

- qA^idu - A„)il>(x») = m 2 ^(x") 

which can also be written 

g^Wv-im 2 )^) = 0. 
In terms of space-time derivatives this reads 



(idt - qV(r,t)) 2 iP(r,t) 



(-iV - qA(r,t)Y + m 2 



(10.178) 

(10.179) 
(10.180) 

(10.181) 



Non- Relativistic Limit of Free Particle Klein Gordon Equation 

In order to consider further the interpretation of the positive and negative energy solutions of the 
Klein-Gordon equation one can consider the non- relativistic limit. For this purpose we split-off a 
factor exp(— imt) which describes the oscillations of the wave function due to the rest energy, and 
focus on the remaining part of the wave function, i.e., we define 

if,(f,t) = e- imt ^(f,t), (10.182) 

and seek an equation for ty(r,t). We will also assume, in keeping withnthe non-relativistic limit, 
that the mass m of the particle, i.e., it's rest energy, is much larger than all other energy terms, in 
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particular, larger than \idt$> /^\ and alrger than qV, i.e., 



<< m , \qV\ « m 



(10.183) 



The term on the l.h.s. of (10.181) can then be approximated as follows: 



(idt - qVYe- tmt ^ 



-imt. 



= {idt — qV) (me 

Jl— imtiT, i imt 



-imt 



e 



id t V - qVe- imt <S>) 



-me 



—imt 
imt 



qVe- imt ^ 



e - imt id t qV* + ^y z e- imt * 



m 2 e -imty 



2mqVe- imt V 



2me- imt id t ^> 



(10.184) 



where we neglected all terms which did not contain factors m. The approximation is justified on 
the ground of the inequalities (10.183). The Klein-Gordon equation (10.181) reads then 



id t V(r,t) = 



[p- qA(f,t)\' 
2m 



+ qV(r,t) 



(10.185) 



This is, however, identical to the Schrodinger equation (10.2) of a non-relativistic spin-0 particle 
moving in an electromagnetic field. 



Pionic Atoms 

To apply the Klein-Gordon equation (10.181) to a physical system we consider pionic atoms, i.e., 
atoms in which one or more electrons are replaced by 7r~ mesons. This application demonstrates 
that the Klein-Gordon equation describes spin zero particles, e.g., spin-0 mesons. 
To 'manufacture' pionic atoms, 7r~ mesons are generated through inelastic proton-proton scattering 



p + p 



P + p + 7T + 7T 



(10.186) 



then are slowed down, filtered out of the beam and finally fall as slow pions onto elements for which 
a pionic variant is to be studied. The process of ir~ meson capture involves the so-called Auger 
effect, the binding of a negative charge (typically an electron) while at the same time a lower shell 
electron is being emitted 



7r + atom 



(atom 



+ O + e 



(10.187) 



We want to investigate in the following a description of a stationary state of a pionic atom involving 
a nucleus with charge +Ze and a ir~ meson. A stationary state of the Klein-Gordon equation is 
described by a wave function 

VK^) = ^{r)e~' ltt . (10.188) 

Inserting this into (10.181) yields (we assume now that the Klein-Gordon equation describes a 
particle with mass m n and charge — e) for qV(f,t) = —^y- and A(f,t) = 



e + 



Ze 2 



+ V 2 



<p(r) 



. 



(10.189) 
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Because of the radial symmetry of the Coulomb potential we express this equation in terms of 
spherical coordinates r, 8, <\>. The Laplacian is 

V 2 = l d 2 r r + —^dewoBde + -^rA = l d 2 r r - % . (10.190) 
r r 2 sm z 8 r 2 sm z 8 v r r 2 

With this expression and after expanding (e + ^^) 2 one obtains 

( d 2 L 2 - Z 2 e A 2eZe 2 2 2 \ , 

( ^2 -2 + — + e " ™l\ = 0. (10.191) 

The operator 1? in this equation suggests to choose a solution of the type 

<p(f) = ^Y, m (M) (10.192) 

where the functions Y( m (8,(j)) are spherical harmonics, i.e., the eigenfunctions of the operator L 2 
in (10.191) 

L 2 Y (m {8,4>) = £(£ + l)Y em (e,4>) . (10.193) 
(10.192) leads then to the ordinary differential equation 

d 2 £(£ + !) -Z 2 e i 2eZe 2 2 2 \ „ , . 

— 2 - ± L + — — + e 2 - ml J Rt(r) = . (10.194) 

Bound state solutions can be obtained readily noticing that this equation is essentially identical to 
that posed by the Coulomb problem (potential — ^-) for the Schrodinger equation 

The latter problem leads to the well-known spectrum 

m (Ze 2 ) 2 

E n = - * K ' ; n = 1, 2, ...;£ = 0, 1, ... n - 1 . (10.196) 

In this expression the number nl defined through 

n! = n - £ - 1 (10.197) 

counts the number of nodes of the wave function, i.e., this quantity definitely must be an integer. 
The similarity of (10.194) and (10.195) can be made complete if one determines A such that 

\{£){\{£) + 1) = £{£ + 1) - Z 2 e 4 . (10.198) 

The suitable choice is 

\{£) = ~\ + J(t+\) 2 ~ Z 2 e 4 (10.199) 
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and one can write (10.194) 

* _ mim±» + + ? . ^ w = . (10 . 2 „o) 

The bound state solutions of this equation should correspond to e values which can be obtained 
from (10.196) if one makes the replacement 

2 2 

E — ► 6 ~ m " , t — ► A(£) , e 2 — ► e 2 — . (10.201) 

One obtains 

,2 _ m 2 m?r Z 2 e 4 -4- 



2m, 2 (n' + \{£) + l) 2 ' (10.202) 

Solving this for e (choosing the root which renders e < m n , i.e., which corresponds to a bound 
state) yields 

n > = 0,1,... ; £ = 0,1,... . (10.203) 



e = 



V 1 + K- 



Z 2 e 4 



Using (10.197, 10.199) and defining Ekg = e results in the spectrum 

E KG (n,£, m) = n = 1,2, .. . 

£ = 0, 1, . . . ,n - 1 



4 Z 2 e 4 m = -£, -i + 1, . . . , +1 

' ' (n-^| + y(£+i) 2 -Z 2 e 4 ) 2 (10.204) 

In order to compare this result with the spectrum of the non-relativistic hydrogen-like atom we 
expand in terms of the fine structure constant e 2 to order 0(e 8 ). Introducing a = Z 2 e 4 and 
j3 = i + \ (10.204) reads 

1 n (10-205) 



i + 



(n-/3+^//3 2 -a) 2 

and one obtains the series of approximations 

1 



1 J a 

(n-/3 + V/3 2 -a) 2 



1 + 



(„-^+0(a 2 )) 2 
1 

1 



1 + £ + ^ + o(« 3 ) 



1 + 2^ + w " i& + °(« 3 
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From this results for (10.204) 

+ 0(Z 6 e 12 ). (10.207) 

Here the first term represents the rest energy, the second term the non-relativistic energy, and the 
third term gives the leading relativistic correction. The latter term agrees with observations of 
pionic atoms, however, it does not agree with observations of the hydrogen spectrum. The latter 
spectrum shows, for example, a splitting of the six n = 2,£ = 1 states into groups of two and four 
degenerate states. In order to describe electron spectra one must employ the Lorentz-invariant 
wave equation for spin-^ particles, i.e., the Dirac equation introduced below. 

It must be pointed out here that e does not denote energy, but in the present case rather the negative 
of the energy. Also, the 7r~ meson is a pseudoscalar particle, i.e., the wave function changes sign 
under reflection. 



mZ 2 e 4 mZ 4 e s 



2n 2 



2n 3 



3 

4n 



10.7 The Dirac Equation 

Historically, the Klein-Gordon equation had been rejected since it did not yield a positive-definite 
probability density, a feature which is connected with the 2nd order time derivative in this equation. 
This derivative, in turn, arises because the Klein-Gordon equation, through the correspondence 
principle, is related to the equation E 2 = m 2 + p 2 of the classical theory which involves a term 
E 2 . A more satisfactory Lorentz-invariant wave equation, i.e., one with a positive-definite density, 
would have only a first order time derivative. However, because of the equivalence of space and 
time coordinates in the Minkowski space such equation necessarily can only have then first order 
derivatives with respect to spatial coordinates. It should feature then a differential operator of the 
type V = i^d^. 

Heuristic Derivation Starting from the Klein-Gordon Equation 

An obvious starting point for a Lorentz-invariant wave equation with only a first order time deriva- 
tive is E = ±yjm 2 + p 2 . Application of the correspondence principle (10.146) leads to the wave 
equation 

idtV(r,t) = ±Vm 2 - V 2 *(r,t) . (10.208) 
These two equation can be combined 

(id t + V™ 2 - V 2 ) (id t - Vm 2 - V 2 ) *(f,t) (10.209) 

which, in fact, is identical to the two equations (10.208). Equations (10.208, 10.209), however, 
are unsatisfactory since expansion of the square root operator involves all powers of the Laplace 
operator, but not an operator 27 • V as suggested by the principle of relativity (equivalence of 
space and time). Many attempts were made by theoretical physicists to 'linearize' the square root 
operator in (10.208, 10.209), but for a long time to no avail. Finally, Dirac succeeded. His solution 
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to the problem involved an ingenious step, namely, the realization that the linearization can be 
carried out only if one assumes a 4-dimensional representation of the coefficients 7^. 
Initially, it was assumed that the 4-dimensional space introduced by Dirac could be linked to 4- 
vectors, i.e., quantities with the transformation law (10.66). However, this was not so. Instead, 
the 4-dimensionsional representation discovered by Dirac involved new physical properties, spin-| 
and anti-particles. The discovery by Dirac, achieved through a beautiful mathematical theory, 
strengthens the believe of many theoretical physicists today that the properties of physical matter 
ultimately derive from a, yet to be discovered, beautiful mathematical theory and that, therefore, 
one route to important discoveries in physics is the creation of new mathematical descriptions of 
nature, these descriptions ultimately merging with the true theory of matter. 

Properties of the Dirac Matrices 

Let us now trace Dirac's steps in achieving the linearization of the 'square root operator' in (10.208). 

Starting point is to boldly factorize, according to (10.209), the operator of the Klein-Gordon 
equation 

+ m 2 = -(P + m)(P - m) (10.210) 

where 

P = i^d^ . (10.211) 

Obviously, this would lead to the two wave equations {P — m)^f = and (P + m)^f = which 
have a first order time derivative and, therefore, are associated with a positive-definite particle 
density. We seek to identify the coefficients 7^. Inserting (10.211) into (10.210) yields 

-grd„a v - m 2 = (i-fd^ + m)(i^d„ - m) 

= -yvaA - ™ 2 = -^(W^A + VVH) - ™ 2 

= -^Y + V/j^-m 2 (10.212) 

where we have changed 'dummy' summation indices, exploited d^d u = d u d^, but did not commute 
the, so far, unspecified algebraic objects 7^ and 7". Comparing the left-most and the right-most 
side of the equations above one can conclude the following property of 7^ 

7V + tV = [7^,71+ = 2^" (10.213) 

We want to determine now the simplest algebraic realization of 7 M . It turns out that no 4- vector of 
real or complex coefficients can satisfy these conditions. In fact, the quantities 7 , 7 1 , 7 2 , 7 s can 
only be realized by d x ci-matrices requiring that the wave function fy(x^) is actually a ^-dimensional 
vector of functions ip^x 1 *), ip2(x' 1 ), . . . ipd(x^). 
For ix = v condition (10.213) reads 

= { : : 1 2 ,3 ■ < io - 2 "> 

From this follows that 7 has real eigenvalues ±1 and 7^, j = 1,2,3 has imaginary eigenvalues ±i. 
Accordingly, one can impose the condition 

7° is hermitian ; 7^ , j = 1,2,3 are anti-hermitian . (10.215) 
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For n + v (10.213) reads 

-f-f = - 7 Y , (10.216) 
i.e., the 7 M are anti-commuting. From this one can conclude for the determinants of 7^ 

det(yy) = det(- 7 V) = (-l) d det( 7 V) = (-l) d det(7V) • (10.217) 

Obviously, as long as det(7 M ) / the dimension d of the square matrices 7^ must be even so that 
(-1)" = 1. 

For d = 2 there exist only three anti-commuting matrices, namely the Pauli matrices a 1 , a 2 , er 3 for 
which, in fact, holds 

(a j ) 2 = H; a j a k = -a k a j for j / k . (10.218) 

The Pauli matrices allow one, however, to construct four matrices 7^ for the next possible dimension 
d = 4. A proper choice is 

■f- ( s a ) ■■ * - ( 1 ) • <™» ) 

Using property (10.218) of the Pauli matrices one can readily prove that condition (10.213) is 
satisfied. We will argue further below that the choice f 7^, except for similarity trasnformations, is 
unique. 

The Dirac Equation 

Altogether we have shown that the Klein-Gordon equation can be factorized formally 

{i^d^ + m) (i'fd^ - m) #(a^) = (10.220) 

where ^(x^) represents a 4-dimensional wave function, rather than a scalar wave function. From 
this equation one can conclude that also the following should hold 

(*7 M <9 M - m) ty^) =0 (10.221) 

which is the celebrated Dirac equation. 

The Adjoint Dirac Equation 

The adjoint equation is 

¥(x^) ^(Y*)^ +m^j = (10.222) 

where we have defined ^ = (ipf, ip^, ^4) and where denotes the differential operator 
operating to the left side, rather than to the right side. One can readily show using the hermitian 
property of the Pauli matrices (7°)^ = 7 and (7^)^ = — 7 J for j = 1,2,3 which, in fact, is implied 
by (10.215). This property can also be written 

( 7 M)t = 7 0yy _ (10.223) 
Inserting this into (10.222) and multiplication from the right by 7 yields the adjoint Dirac equation 

^0^)7° ^7 M <J +m^j = 0. (10.224) 
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Similarity Transformations of the Dirac Equation - Chiral Representation 

The Dirac equation can be subject to any similarity transformation denned through a non-singular 
4 x 4-matrix S. Defining a new representation of the wave function 

vfr(a^) = (10.225) 

leads to the 'new' Dirac equation 

(i^d^ - m) §(x^) = (10.226) 

where 

^ = S^S- 1 (10.227) 

A representation often adopted beside the one given by (10.222, 10.219) is the socalled chiral 
representation defined through 

¥(*") = S¥(s") ; S = ±= ( I _\ ) (10.228) 



and 



V* ! I I) ; f = (° aJ ~o 1 .i-^. (.0.229) 



The similarity transformation (10.227) leaves the algebra of the Dirac matrices unaffected and 
commutation property (10.213) still holds, i.e., 

= 2gT . (10.230) 



Exercise 10.7.1: Derive (refeq:Dirac-intro20a) from (10.213), (10.227). 



Schrodinger Form of the Dirac Equation 

Another form in which the Dirac equation is used often results from multiplying (10.221) from the 
left by 7° 

(ify + ia-V - /3m) tf(r,t) = (10.231) 
wherea has the three components oij, j = 1, 2, 3 and 

f>= ( I ; &, = ( ° ,)- 1,2,3. (10.232) 

This form of the Dirac equation is called the Schrodinger form since it can be written in analogy 
to the time-dependent Schrodinger equation 

id t ^{x^) = H ^(x^); H = a ■ + (3 m. (10.233) 

The eigenstates and eigenvalues of TL correspond to the stationary states and energies of the particles 
described by the Dirac equation. 
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Clifford Algebra and Dirac Matrices 

The matrices defined through 

dj = i~i 3 ,3 = 1,2,3 ; d 4 = 7° (10.234) 
satisfy the anti-commutation property 

f 2 for i = k 

djdk + d k d 3 = | Q l J- + k (10.235) 

as can be readily verified from (10.213). The associative algebra generated by d\ . . . d 4 is called a 
Clifford algebra C 4 . The three Pauli matrices also obey the property (10.235) and, hence, form a 
Clifford algebra C 3 . The representations of Clifford algebras C m are well established. For example, 
in case of C4, a representation of the d/s is 



ioy^V 01 / v -1 / ^ V 1 

where '<8>' denotes the Kronecker product between matrices, i.e., the matrix elements of C = A®B 
are Cjk/ m = AjiB km . 

The Clifford algebra C 4 entails a subgroup G4 of elements 

± d h d h ■ ■ ■ d js , h < j 2 < ■ ■ ■ < j s s < 4 (10.237) 

which are the ordered products of the operators ±1 and d±, d 2 , d^, d^. Obviously, any product 
of the dj's can be brought to the form (10.237) by means of the property (10.235). There are 
(including the different signs) 32 elements in G4 which we define as follows 

r±i = ±1 

r ±2 = ±di , r ±3 = ±d 2 , r ±4 = ±d 3 , r ±5 = ±d 4 

r ±6 = ±did 2 , r ±7 = ±dtd 3 , r ±8 = ±d 1 d 4 , r ±9 = ±^2^3 

r±i = ±6^4, r ± n = ±d 3 d 4 

T±i2 = ±did 2 d 3 , r±i3 = ±d 1 d 2 d 4 , r ± i 4 = ±d\d 3 di , T± 15 = ±d 2 d 3 d 4 

r±i 6 = ±d 1 d 2 d 3 d 4 (10.238) 

These elements form a group since obviously any product of two T r 's can be expressed in terms 
of a third Y r . The representations of this group are given by a set of 32 4 x 4-matrices which 
are equivalent with respect to similarity transformations. Since the Tj are hermitian the similarity 
transformations are actually given in terms of unitary transformations. One can conclude then that 
also any set of 4 x 4-matrices obeying (10.235) can differ only with respect to unitary similarity 
transformations. This property extends then to 4 x 4-matrices which obey (10.213), i.e., to Dirac 
matrices. 

To complete the proof in this section the reader may consult Miller 'Symmetry Groups and their 
Application' Chapter 9.6 and R.H.Good, Rev.Mod.Phys. 27, (1955), page 187. The reader may 
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also want to establish the unitary transformation which connects the Dirac matrices in the form 
(10.236) with the Dirac representation (10.219). 



Exercise 7.2: 

Demonstrate the anti-commutation relationships (10.218) of the Pauli matrices aK 
Exercise 7.3: 

Demonstrate the anti-commutation relationships (10.218) of the Dirac matrices 7 M . 
Exercise 7.4: 

Show that from (10.214) follows that 7 has real eigenvalues ±1 and can be represented by a 
hermitian matrix, and j 3 , j = 1,2,3 has imaginary eigenvalues ±i and can be represented by an 
anti-hermitian matrix. 



10.8 Lorentz Invariance of the Dirac Equation 

We want to show now that the Dirac equation is invariant under Lorentz transformations, i.e., the 
form of the Dirac equation is identical in equivalent frames of reference, i.e., in frames connected by 
Lorentz transformations. The latter transformations imply that coordinates transform according 
to (10.6), i.e., x' = LV- u x v , and derivatives according to (10.80). Multiplication and summation 
of (10.80) by L^ p and using (10.76) yields d p = L u p d' u , a result one could have also obtained by 
applying the chain rule to (10.6). We can, therefore, transform coordinates and derivatives of the 
Dirac equation. However, we do not know yet how to transform the 4-dimensional wave function 
and the Dirac matrices 7 M . 

Lorentz Transformation of the Bispinor State 

Actually, we will approach the proof of the Lorentz invariance of the Dirac equation by testing 
if there exists a transformation of the bispinor wave function and of the Dirac matrices 7^ 
which together with the transformations of coordinates and derivatives leaves the form of the Dirac 
equation invariant, i.e., in a moving frame should hold 

- m) ^'(x m ) = 0. (10.239) 

Form invariance implies that the matrices 7" 1 should have the same properties as 7^, namely, 
(10.213, 10.215). Except for a similarity transformation, these properties determine the matrices 
7 /M uniquely, i.e., it must hold 7'^ = 7^. Hence, in a moving frame holds 

{i^d'p - m) ^'(x" 1 ) = 0. (10.240) 
Infinitesimal Bispinor State Transformation 

We want to show now that a suitable transformation of ^(x M ) does, in fact, exist. The transfor- 
mation is assumed to be linear and of the form 



^V M ) = S(L"„) ¥(z") ; x'" = L» v x v 



(10.241) 
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where S(L^ U ) is a non-singular 4 x 4-matrix, the coefficients of which depend on the matrix V" v 
defining the Lorentz transformation in such a way that S{L^ V ) = IL for L^ u = b^ v holds. Ob- 
viously, the transformation (10.241) implies a similarity transformation S^/^S -1 . One can, hence, 
state that the Dirac equation (10.221) upon Lorentz transformation yields 

(iS(L r > ( )'fS- 1 (L r >t)L'' lt & v - m) ^'(x" 1 ) = 0. (10.242) 

The form invariance of the Dirac equation under this transformation implies then the condition 

S{L\)^S~\L\)L\ = 7 ". (10.243) 

We want to determine now the 4 x 4-matrix S^L 11 ^) which satisfies this condition. 

The proper starting point for a constructiuon of ^(L^) is actually (10.243) in a form in which the 

Lorentz transformation in the form is on the r.h.s. of the equation. For this purpose we exploit 

(10.12) in the form L v p g UCT L a p = g pp = g ppi . Multiplication of (10.243) from the left by L a p g au 

yields 

S{L\)rS-\L\)g m = L\g Gvlu . (10.244) 
from which, using g pil pf ,x = 7 P , follows 

S(L\) 7p S~\L\) = L% la . (10.245) 

One can finally conclude multiplying both sides by g m 

5(L\) 7 ^- 1 (L\) = L vti lv . (10.246) 

The construction of S^LP^) will proceed using the avenue of infinitesimal transformations. We had 
introduced in (10.38) the infinitesimal Lorentz transformations in the form L^ v = 5^ u + e^„ where 
the infinitesimal operator e^ u obeyed e T = — geg. Multiplication of this property by g from the 
right yields (eg) T = —eg, i.e., eg is an anti-symmetric matrix. The elements of eg are, however, 
e n ^pv _ e iiv anc ^ hence, in the expression of the infinitesimal transformation 

L^ u = g^ + e"" (10.247) 

the infinitesimal matrix e^ v is anti-symmetric. 

The infinitesimal transformation S{L p a ) which corresponds to (10.247) can be expanded 

S{fT) = 1 - \°^ v (10.248) 

Here a^ u denote 4 x 4-matrices operating in the 4-dimensional space of the wave functions ^. 
S(e fiu ) should not change its value if one replaces in its argument e^ u by — e Ufl . It holds then 

5(0 = 1 - \°^ v = S(~e u n = 1 + (10.249) 
from which we can conclude a^e^ = —(J piV e Uil = —(J vp e uv , i.e., it must hold 

<V = • (10.250) 
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One can readily show expanding SS = 1 to first order in e^ u that for the inverse infinitesimal 
transformation holds 

S~\e^) = TL+'-a^ (10.251) 
Inserting (10.248, 10.251) into (10.246) results then in a condition for the generators <r^ u 

-\ (<T a p-f - r<r ap ) e * 3 = ?* lv . (10.252) 

Since six of the coefficients e a @ can be chosen independently, this condition can actually be expressed 
through six independent conditions. For this purpose one needs to express formally the r.h.s. of 
(10.252) also as a sum over both indices of e a/3 . Furthermore, the expression on the r.h.s., like the 
expression on the l.h.s., must be symmetric with respect to interchange of the indices a and (3. For 
this purpose we express 

= \<? P (S^a ~ ^ al p) ■ (10.253) 

Comparing this with the l.h.s. of (10.252) results in the condition for each a, (3 

7"]_ = 2z {8^ la - S^-yp) . (10.254) 

The proper a a p must be anti-symmetric in the indices a, (5 and operate in the same space as the 
Dirac matrices. In fact, a solution of condition (10.254) is 

CTaf) = \ [7«,7/?]_ (10.255) 
which can be demonstrated using the properties (10.213, 10.216) of the Dirac matrices. 



Exercise 7.5: 

Show that the a a (3 defined through (10.255) satisfy condition (10.254). 



Algebra of Generators of Bispinor Transformation 

We want to construct the bispinor Lorentz transformation by exponentiating the generators a^v 
For this purpose we need to verify that the algebra of the generators involving addition and mul- 
tiplication is closed. For this purpose we inspect the properties of the generators in a particular 
representation, namely, the chiral representation introduced above in Eqs. (10.228, 10.229). In this 
representation the Dirac matrices 7^ = (7 , —7) are 

70 = [ \ J) ; 13= ) >i = W- ( 10 - 256 ) 

One can readily verify that the non-vanishing generators a^ v are given by (note a^ u = — aVuj i.e. 
only six generators need to be determined) 

°0j = I [7o,7j] = ( 1 q £j ) ; = [7j,7fe] = tjkt( q ^ ) ■ (10.257) 
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Obviously, the algebra of these generators is closed under addition and multiplication, since both 
operations convert block-diagonal operators 

(os) 

again into block-diagonal operators, and since the algebra of the Pauli matrices is closed. 
We can finally note that the closedness of the algebra of the generators a^ u is not affected by 
similarity transformations and that, therefore, any representation of the generators, in particular, 
the representation (10.255) yields a closed algebra. 



(10.258) 



Finite Bispinor Transformation 

The closedness of the algebra of the generators defined through (10.248) allows us to write the 
transformation S for any, i.e., not necessarily infinitesimal, e^ u in the exponential form 



exp 



(10.259) 



We had stated before that the transformation S is actually determined through the Lorentz trans- 
formation L^ y . One should, therefore, be able to state S in terms of the same parameters w and 
$ as the Lorentz transformation in (10.51). In fact, one can express the tensor e^ v through w and 
■d using e^ v = e^ p g pu and the expression (10.44) 
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(10.260) 



Inserting this into (10.259) yields the desired connection between the Lorentz transformation (10.51) 
and S. 

In order to construct an explicit expression of S in terms of w and ^ we employ again the chiral 
representation. In this representation holds 



- iS) ■ a \ 

-(w + i-d) -a ) 




(10.261) 



We note that this operator is block-diagonal. Since such operator does not change its block-diagonal 
form upon exponentiation the bispinor transformation (10.259) becomes in the chiral representation 



S(w, 0) = 








(10.262) 



This expression allows one to transform according to (10.241) bispinor wave functions from one 
frame of reference into another frame of reference. 
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Current 4- Vector Associated with Dirac Equation 

We like to derive now an expression for the current 4-vector j M associated with the Dirac equation 
which satisfies the conservation law 

d^f = . (10.263) 

Starting point are the Dirac equation in the form (10.221) and the adjoint Dirac equation (10.224). 
Multiplying (10.221) from the left by ^(a^ , (10.224) from the right by ^(x^), and addition 
yields 

¥(x^)~f° [i'fdn + i'fd^j #(x M ) = 0. (10.264) 
The last result can be written 

d li &(x v )'f'fV(x v ) = 0, (10.265) 
i.e., the conservation law (10.263) does hold, in fact, for 

f{x?) = (p, j) = ^t( x M) 7 0y^(^) _ (10.266) 

The time-like component p of j M 

4 

p( x ^) = ^t( x M)^( x M) = ^|^ S ( X M)|2 (10.267) 

s=l 

has the desired property of being positive definite. 

The conservation law (10.263) allows one to conclude that must transform like a contravariant 
4-vector as the notation implies. The reason is that the r.h.s. of (10.263) obviously is a scalar 
under Lorentz transformations and that the left hand side must then also transform like a scalar. 
Since transforms like a covariant 4-vector, must transform like a contravariant 4-vector. This 
transformation behaviour can also be deduced from the transformation properties of the bispinor 
wave function ^(a^). For this purpose we prove first the relationship 

S- 1 = 7 °5 t 7° . (10.268) 

We will prove this property in the chiral representation. Obviously, the property applies then in 
any representation of S. 
For our proof we note first 

S-\^) = = [ q eW+iS) , a J (10.269) 

One can readily show that the same operator is obtained evaluating 



7 



o , 



In «T US : ) ■ do.™) 



1 J \ e -\(w-vd)-a M 1 



We conclude that (10.268) holds for the bispinor Lorentz transformation. 
We will now determine the relationship between the flux 



j = tf'V hV*V) (10.271) 



322 



Relativistic Quantum Mechanics 



in a moving frame of reference and the flux j M in a frame at rest. Note that we have assumed 
in (10.271) that the Dirac matrices are independent of the frame of reference. One obtains using 
(10.268) 

/" = tft^styy^^) = tf( a J*)7°S- 1 'fSV(xi i ) . (10.272) 
With = ^((L- 1 )^) one can restate (10.246) 

S-\L\)^L\) = (L- 1 )^ = (IT^V = L\ lv . (10.273) 

where we have employed (10.76). Combining this with (10.272) results in the expected transfor- 
mation behaviour 

= L^f . (10.274) 

10.9 Solutions of the Free Particle Dirac Equation 

We want to determine now the wave functions of free particles described by the Dirac equation. 
Like in non-relativistic quantum mechanics the free particle wave function plays a central role, not 
only as the most simple demonstration of the theory, but also as providing a basis in which the wave 
functions of interacting particle systems can be expanded and characterized. The solutions provide 
also a complete, orthonormal basis and allows one to quantize the Dirac field \&(x M ) much like the 
classical electromagnetic field is quantized through creation and annihilation operators representing 
free electromagnetic waves of fixed momentum and frequency. 

In case of non-relativistic quantum mechanics the free particle wave function has a single component 
ip(r,t) and is determined through the momentum p £ M 3 . In relativistic quantum mechanics a 
Dirac particle can also be characterized through a momentum, however, the wave function has four 
components which invite further characterization of the free particle state. In the following we 
want to provide this characterization, specific for the Dirac free particle. 
We will start from the Dirac equation in the Schrodinger form (10.231, 10.232, 10.233) 

id t ^(x^) = n o ^(x^) . (10.275) 

The free particle wave function is an eigenfunction of 7i , a property which leads to the energy- 
momentum (dispersion) relationship of the Dirac particle. The additional degrees of freedom de- 
scribed by the four components of the bispinor wave function require, as just mentioned, additional 
characterizations, i.e., the identification of observables and their quantum mechanical operators, of 
which the wave functions are eigenfunctions as well. As it turns out, only two degrees of freedom 
of the bispinor four degrees of freedom are independent [c.f. (10.282, 10.283)]. The independent 
degrees of freedom allow one to choose the states of the free Dirac particle as eigenstates of the 
4-momentum operator p^ and of the helicity operator T ~ a -p/\p\ introduced below. These opera- 
tors, as is required for the mentioned property, commute with each other. The operators commute 
also with n in (10.233). 

Like for the free particle wave functions of the non-relativistic Schrodinger and the Klein-Gordon 
equations one expects that the space-time dependence is governed by a factor exp[i(p • f — et)]. 
As pointed out, the Dirac particles are described by 4-dimensional, bispinor wave functions and we 
need to determine corresponding components of the wave function. For this purpose we consider 



10.9: Solutions of the Free Particle Dirac Equation 



323 



the following form of the free Dirac particle wave function 

•<*-) = ( j$ ) = (*:) < ia276) 

where p and e together represent four real constants, later to be identified with momentum and 
energy, and <j> , Xo each represent a constant, two-dimensional spinor state. Inserting (10.276) into 
(10.231, 10.232) leads to the 4-dimensional eigenvalue problem 



m a • p 
a ■ p -m J \ Xo J \ Xo 



(10.277) 



To solve this problem we write (10.277) explicitly 



(e - m) 1 4> - a ■ p Xo =0 

-a-p4> + (e + m) TL X o = 0. (10.278) 

Multiplication of the 1st equation by (e +m)IL and of the second equation by — a-p and subtraction 
of the results yields the 2-dimensional equation 

[(e 2 - m 2 )l - (a-p) 2 ](f> = 0. (10.279) 

According to the property (5.234) of Pauli matrices holds (a -p) 2 = p 2 IL. One can, hence, conclude 
from (10.279) the well-known relativistic dispersion relationship 

e 2 = m 2 + p 2 (10.280) 

which has a positive and a negative solution 

e = ±E(j>), E{p) = yjm 2 + p 2 . (10.281) 

Obviously, the Dirac equation, like the Klein-Gordon equation, reproduce the classical relativistic 
energy-momentum relationships (10.93, 10.94) 

Equation (10.278) provides us with information about the components of the bispinor wave function 
(10.276), namely (f> and Xo are related as follows 

00 = ^^Xo (10.282) 
e — m 

Xo = -^-<h, (10.283) 
e + m 

where e is defined in (10.281). These two relationships are consistent with each other. In fact, one 
finds using (5.234) and (10.280) 

~ ( ^ X0 = ^X0 = X0. (10.284) 



e + m (e + m) (e + m) e 2 — m 2 

The relationships (10.282, 10.283) imply that the bispinor part of the wave function allows only 
two degrees of freedom to be chosen independently. We want to show now that these degrees of 
freedom correspond to a spin-like property, the socalled helicity of the particle. 
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Xo = [ I = u £ C , ir u = \ Ul \ 2 + M = 1 • (10.288) 

1 U2 



For our further characterization we will deal with the positive and negative energy solutions [cf. 
(10.281)] separately. For the positive energy solution, i.e., the solution for e = +E(p), we present 
(f>o through the normalized vector 

^ } = u G C 2 , u f u = \ Ul \ 2 + \u 2 \ 2 = 1 . (10.285) 

The corresponding free Dirac particle is then described through the wave function 

*(£+K) = N+{p) ( ] e 1 ^-^ , e = +E(p) . (10.286) 

V E(p) + m U J 

Here A/+ (p) is a constant which will be chosen to satisfy the normalization condition 

7° = 1, (10.287) 

the form of which will be justified further below. Similarly, we present the negative energy solution, 
i.e., the solution for e = —E(p), through Xo given by 

n l 1 ^ ml t I 1 2 i i 1 2 

«2 / 

corresponding to the wave function 

*(p,-|^) = A/"_(£) ^ ^ + m U ^ e ^- rf ) , e = -£?(p) . (10.289) 

Here jV_ (p) is a constant which will be chosen to satisfy the normalization condition 

* t (p,+) 7 °*t(p,+) = -1 , (10.290) 

which differs from the normalization condition (10.287) in the minus sign on the r.h.s. The form 
of this condition and of (10.287) will be justified now. 

First, we demonstrate that the product ^(p, ±)j°^f(p, ±), i.e., the l.h.s. of (10.287, 10.290), is 
invariant under Lorentz transformations. One can see this as follows: Let ^(p, ±) denote the 
solution of a free particle moving with momentum p in the laboratory frame, and let ^(0, ±) 
denote the corresponding solution of a particle in its rest frame. The connection between the 
solutions, according to (10.241), is ^(p, ±) = S'^'(0,±) , where S is given by (10.259). Hence, 

*t ( £±) /*(£+) = (V(o,±)st) 7 °<?*(o,±) 

= * f (0, ±) ( 7 ° S- 1 7°) 7 ° S (0, ±) 

= * t (0,±)7°*(0,±). (10.291) 

Note that we have used that, according to (10.268), S" -1 = 7°S r '~7 and, hence, = 7°S~ 1 7°. 
We want to demonstrate now that the signs on the r.h.s. of (10.287, 10.290) should differ. For this 
purpose we consider first the positive energy solution. Employing (10.286) for p = yields, using 
7° as given in (10.219) and u+u = 1 [cf. (10.285)], 

*t(0,+) 7 °*(0,+) = |AA + (0)| 2 (ut, 0)7° ( o ) = |AA+(0)| 2 . (10.292) 
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The same calculation for the negative energy wave function as given in (10.289) yields 

*t (0j _) 7 Otf(o,-) = |AA_(0)| 2 (0, ut) 7 ° ^ ^ = _ |A/ -_ (0)|2 (102Q3) 

Obviously, this requires the choice of a negative side on the r.h.s. of (10.290) to assign a positive 
value to \Af-(0)\ 2 . We can also conclude from our derivation 



A4(0) = 1 



(10.294) 



We want to determine now N±{p) for arbitrary p. We consider first the positive energy solution. 
Condition (10.287) written explicitly using (10.286) is 



um ( U *) 



a ■ p 



E(p) + m 



u 



u 

d-p 



E(p)+m 



U 



(10.295) 



Evaluating the l.h.s. using 7 as given in (10.219) yields 

(a-p) 2 



MUp) 



u^u — 



(E(p) + m) 2 



u 



= 1 . 



(10.296) 



Replacing (<? • p) 2 by p 2 [c.f. (5.234)] and using the normalization of u in (10.285) results in 



from which follows 



M + {P) 



1 - 



p 2 



{E{p) + m) 2 



= 1 



(m + .E(p)) 2 



Y ( m + £?(p) ) 2 - p 2 ' 
Noting 

(m + E{p)) 2 - p 2 = m 2 - p 2 + 2mE(p) + £ 2 (p) = 2(m + E{p))m 
the normalization coefficient (10.298) becomes 



M + {p) 



2m 



(10.297) 



(10.298) 



(10.299) 



(10.300) 



This result completes the expression for the wave function (10.286). 



Exercise 7.6: Show that the normalization condition 

1 7^ 



yields the normalization coefficient 



a • p 



E{p) + m 



u 



u 



u 



= 1 



(10.301) 



m + E(p) 



(10.302) 
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We consider now the negative energy solution. Condition (10.290) written explicitly using (10.289) 
is 



Ml® y 

Evaluating the l.h.s. yields 



-a ■ p 



E{p) + m 



u 



, K) J 



r 



— cr-p 
E(p) + m 

U 



U 



1 



u 



(E(p) + m) 2 



u 



u^u 



(10.303) 



(10.304) 



This condition is, however, identical to the condition (10.296) for the normalization constant N+{p) 
of the positive energy solution. We can, hence, conclude 



m + E(p) 



2m 



(10.305) 



and, thereby, have completed the determination for the wave function (10.289). 

The wave functions (10.286, 10.289, 10.300) have been constructed to satisfy the free particle Dirac 

equation (10.275). Inserting (10.286) into (10.275) yields 



i.e., the wave functions constructed represent eigenstates of TC . 
eigenstates of the momentum operator id^, i.e., 

id^ip, Ala^) = p^^{p,X\x^) 



(10.306) 

The wave functions are also 
(10.307) 



where p^ = (e, —p). This can be verified expressing the space-time factor of *&(p, X\x^) in 4- vector 
notation, i.e., exp[z(p- f — et)] = exp(ip^x M ). 



Helicity 

The free Dirac particle wave functions (10.286, 10.289) are not completely specified, the two com- 
ponents of u indicate another degree of freedom which needs to be defined. This degree of freedom 
describes a spin-^ attribute. This attribute is the so-called helicity, defined as the component of 
the particle spin along the direction of motion. The corresponding operator which measures this 
observable is 



A 



1 p 

2 a m 



(10.308) 



Note that p represents here an operator, not a constant vector. Rather than considering the 
observable (10.307) we investigate first the observable due to the simpler operator a • p. We 
want to show that this operator commutes with 7i and p to ascertain that the free particle wave 
function can be simultaneously an eigenvector of all three operators. The commutation property 
[a-p,pj] = 0,j = 1,2,3 is fairly obvious. The property [a ■ p,H ] = follows from (10.233) and 
from the two identities 



11 









a 
a 



■p 



a 
a 



■P 
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(10.309) 
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and 



3 
3 



i ( 3 
■ p ( ff 



3 
3 



■ P 



3 
3 



{3-p? 
{3-p? 



(3-3) 2 
{3-3? 



P 



. 



(10.310) 



We have shown altogether that the operators p, H and 3-p commute with each other and, hence, can 
be simultaneously diagonal. States which are simultaneously eigenvectors of these three operators 
are also simulteneously eigenvectors of the three operators p, 7i and A defined in (10.308) above. 
The condition that the wave functions (10.286) are eigenfunctions of A as well will specify now the 
vectors u. 

Since helicity is defined relative to the direction of motion of a particle the characterization of u 
as an eigenvector of the helicity operator, in principle, is independent of the direction of motion 
of the particle. We consider first the simplest case that particles move along the ^-direction, i.e., 
p = (0,0, p 3 ). In this case A = ^cr 3 . 

We assume first particles with positive energy, i.e., e = +E{p). According to the definition (5.224) 
of (73 the two u vectors (1,0) T and (0, 1) T are eigenstates of |<t 3 with eigenvalues Therefore, 
the wave functions which are eigenstates of the helicity operator, are 



y(pe 3 ,+,+l\f,t) = N p 



¥(pe 3 ,+,-!|r,t) = N p 



y m + Ep 
( 



-v 

rn + Ev 



\ 



i(px 3 — E p t) 



i(px 3 — Ept) 



(10.311) 



where e 3 denotes the unit vector in the ^-direction and where 



E p = v^T^; M p = \^^. 

2m 



(10.312) 



We assume now particles with negative energy, i.e., e 
eigenfunctions of the helicity operator are in this case 



*(pe 3 ,-,+i|r,f) = Np 



m + En 



E(p). The wave functions which are 
\ 



where E p and N p are defined in (10.312) 



= K 



m + Ep 



J 



,i(px 3 + Ept) 



i(px 3 + Ept) 



(10.313) 
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To obtain free particle wave functions for arbitrary directions of p one can employ the wave functions 
(10.311, 10.313) except that the states (1, 0) T and (0, 1) T have to be replaced by eigenstates u±(p) 
of the spin operator along the direction of p. These eigenstates are obtained through a rotational 
transformation (5.222, 5.223) as follows 



u(p, + *) 



u(p, 



1 



exp 
exp 



where 



2 
i 
2 

e3 x p 
\P\ 



i9(p) • o 

3(p)-a 

Ah, V) 



(10.314) 
(10.315) 

(10.316) 



describes a rotation which aligns the X3-axis with the direction of p. [One can also express the 
rotation through Euler angles a, [3, 7 , in which case the transformation is given by (5.220).] The 
corresponding free particle wave functions are then 



V(p,+,+±\r,t) 

*(P,+,-||r,t) 
V(p,-,+l\r,t) 



1 ii? 
2 



r,t) 



= M p \ 

= K \ 
= M 
= K \ 



u(p, 
2—u(p, 



m + E, 



u(p, 



^E- v ^+\) 

u(p,+l) 
u(P,-|) 



J(p-r- E p t) 



i(p-r — Ept) 



J(p-r + Ept) 



J(p-r + Ept) 



(10.317) 
(10.318) 
(10.319) 
(10.320) 



where E p and Af are again given by (10.312). 



Generating Solutions Through Lorentz Transformation 

The solutions (10.311, 10.312) can be obtained also by means of the Lorentz transformation (10.262) 
for the bispinor wave function and the transformation (10.123). For this purpose one starts from 
the solutions of the Dirac equation in the chiral representation (10.226, 10.229), denoted by ~, for 
an r-independent wave function, i.e., a wave function which represents free particles at rest. The 
corresponding wave functions are determined through 



{ifdt - m) §(t) = 



(10.321) 



and are 



*(p = 0, +, \\t) 



1 

V2 



( 1 \ 



1 

V J 



-imt 
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*(p = o, +, -||t) 



i 

^2 



1 



V i / 



D —imt 



*(p = o,- 



1 

72 



*(P = 0, -, -||t) = 



i 



/ 1 \ 



-1 

V o / 

/ \ 

1 



V -i / 



Yimt 



D +imt 



(10.322) 



The reader can readily verify that transformation of these solutions to the Dirac representationsas 
defined in (10.228) yields the corresponding solutions (10.311, 10.313) in the p — > limit. This 
correspondence justifies the characterization ±, ±| of the wave functions stated in (10.322). 
The solutions (10.322) can be written in spinor form 



V2\Xo 



9 0! Xo e 



(10.323) 



Transformation (10.262) for a boost in the X3-direction, i.e., for w = (0,0, W3), yields for the 
exponential space-time dependence according to (10.174, 10.176) 



and for the bispinor part according to (10.262) 



Xo 



e2 



-M!3<T 3 











e 2 



ui3<T" 



Xo 



4> 



(10.324) 



(10.325) 



One should note that (j) , Xo are eigenstates of a 3 with eigenvalues ±1. Applying (10.324, 10.325) 
to (10.323) should yield the solutions for non-vanishing momentum p in the ^-direction. For the 
resulting wave functions in the chiral representation one can use then a notation corresponding to 
that adopted in (10.311) 



y{p{w 3 )e 3 , +, + §|r,t) 



V(p(w 3 )e 3 , +, -h\f,t) 



1 

72 
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71 



e 2 
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e i(px 3 - E p t) 



( e -\ w * 



e2 




1 




J(px 3 - Ept) 
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V(p(w 3 )e 3 , -, +l\r,t) = 



^(p(w 3 )e 3 , -, -\\r,t) = 



1 

71 



i 

71 



( e 3«>3 | 


' 1 ^ 




v j 


)) 




' 1 ^ 










v j 



3 i(pa; 3 + E p t) 



V 



e 2 





1 J I 



D i(px 3 + Ept) 



(10.326) 



where according to (10.61) p(w 3 ) = msinhu^. Transformation to the Dirac representation by 
means of (10.228) yields 



V(p(w 3 )e 3 , +, +l\r,t) 



cosh^ 



sinh^ 
\ 2 v J J 



1 > \ \ 


1 



J(px 3 - E p t) 



( 



V(p(w 3 )e 3 , +, -§|r,t) 



V(p(w 3 )e 3 , -, +l\r,t) 



cosh^f . 



N \ \ 



sinh^- 



sinh^ 



cosh^ 






1 

; / 



J(px 3 - Ept) 



J(px 3 + E p t) 



V(p(w 3 )e 3 , -, -z\r,t) = 
Employing the hyperbolic function properties 



sinh^ . ! 



cosh^f 





1 ) J 



,i(px 3 + Ept) 



cosh: 



coshx +1 x 

, Sinn— 

2 ' 2 



cosha; — 1 



(10.327) 



(10.328) 



the relationship (10.61) between the parameter w 3 and boost velocity v 3 , and the expression (10.311) 



for E p one obtains 
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w 3 jEp - m p 



and similarly 

sinh— = — = = (10.330) 

2 V 2m ^/2m (E p + m) 

Inserting expressions (10.329, 10.330) into (10.327), indeed, reproduces the positive energy wave 
functions (10.311) as well as the negative energy solutions (10.313) for —p. The change of sign for 
the latter solutions had to be expected as it was already noted for the negative energy solutions of 
the Klein-Gordon equation (10.168-10.176). 



Invariance of Dirac Equation Revisited 

At this point we like to provide a variation of the derivation of (10.243), the essential property 
stating the Lorentz-invariance of the Dirac equation. Actually, we will derive this equation only 
for infinitesimal transformations, which however, is sufficient since (1) it must hold then for any 
finite transformation, and since (2) the calculations following (10.243) considered solely the limit 
of infinitesimal transformations anyway. 

The reason why we provide another derivation of (10.243) is to familiarize ourselves with a formu- 
lation of Lorentz transformations of the bispinor wave finction ^f(x^) which treats the spinor and 
the space-time part of the wave function on the same footing. Such description will be essential for 
the formal description of Lorentz invariant wave equations for arbitray spin further below. 
In the new derivation we consider the particle described by the wave function transformed, but not 
the observer. This transformation, refered to as the active transformation, expresses the system in 
the old coordinates. The transformation is 



tf'(z") = S(L\) p(L\) V(a?) (10.331) 

where 5(L' ? g) denotes again the transformation acting on the bispinor character of the wave function 
\l/(x^) and where p^LP^) denotes the transformation acting on the space-time character of the 
wave function ty(x^). p(L v ^) has been defined in (10.123) above and characterized there. Such 
transformation had been applied by us, of course, when we generated the solutions ^>(p,X,A\x^) 
from the solutions describing particles at rest ^{p = 0, A, A|t). We expect, in general, that if ^f(x^) 
is a solution of the Dirac equation that ^f'(x^) as given in (10.331) is a solution as well. Making 
this expectation a postulate allows one to derive the condition (10.243) and, thereby, the proper 
transformation S(L V ^). 

To show this we rewrite the Dirac equation (10.221) using (10.331) 

(iS(L«t)'fS- 1 (L r >z)p(L'>t)d li p- 1 (L'>t) - m) *'(x") = (10.332) 

Here we have made use of the fact that S^IT^) commutes with and p^L^^) commutes with 7^. 
The fact that any such ^f'(x^) is a solution of the Dirac equation allows us to conclude 

S{L\)rS-\L\) p{L\)d,p-\L\) = Ydu (10.333) 

which is satisfied in case that the following conditions are met 

p{L\)d»p-\L\) = L\3 V - 
S(LV^S-\L\)L^ =Y- (10.334) 
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We will demonstrate now that the first condition is satisfied by p(L v ^). The second condition is 
identical to (10.243) and, of course, it is met by S(L V ^) as given in the chiral representation by 
(10.262). 

As mentioned already we will show condition (10.334) for infinitesimal Lorentz transformations 
L v £. We will proceed by employing the generators (10.128) to express p(L v ^) in its infinitesimal 
form and evaluate the expression 



+ ed • J + ew ■ K) ft ( 1 - ed ■ J - ew ■ K 



= ft, + eM\3 v + 0(e 2 ) 



(10.335) 



The result will show that the matrix M v ^ is identical to the generators of V ^ for the six choices i? = 
(1,0,0), w = (0,0,0), 3 = (0,1,0), w = (0,0,0), i? = (0,0,0), w = (0,0,1) . Inspection of 
(10.335) shows that we need to demonstrate 



We will proceed with this task considering all six cases: 
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(10.336) 



(10.337) 



(10.338) 



(10.339) 



(10.340) 



(10.341) 



(10.342) 



One can readily convince oneself that these results are consistent with (10.336). We have demon- 
strated, therefore, that any solution \l/(x^) transformed according to (10.331) is again a solution of 
the Dirac equation, i.e., the Dirac equation is invariant under active Lorentz transformations. 
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10.10 Dirac Particles in Electromagnetic Field 

We like to provide now a description for particles governed by the Dirac equation which includes 
the coupling to an electromagnetic field in the minimum coupling description. Following the re- 
spective procedure developed for the Klein-Gordon equation in Sect. 10.6 we assume that the field 
is described through the 4- vector potential and, accordingly, we replace in the Dirac equation 
the momentum operator p^ = id^ by id^ — qA^ where q is the charge of the respective particles 
(see Table 10.1 in Sect 10.6 above). Equivalently, we replace the operator by + iqA^. The 
Dirac equation (10.221) reads then 

[«7 M (3 M + iqAp) - m\ y(x v ) = (10.343) 

One may also include the electromagnetic field in the Dirac equation given in the Schrodinger form 
(10.233) by replacing idt by (see Table 10.1) idt — qV and p by 

# = p -qA. (10.344) 

The Dirac equation in the Schrodinger form reads then 

id t ^(x^) = (ds-# + qV + /3m) *(x' t ) (10.345) 

where a and $ are defined in (10.232). 

Non-Relativistic Limit 

We want to consider now the Dirac equation (10.345) in the so-called non-relativistic limit in which 
all energies are much smaller than m, e.g., for the scalar field V in (10.345) holds 

\qV\ « m . (10.346) 

For this purpose we choose the decomposition 

= ( 5l2 ) ■ ( io - 347 ) 



Using the notation a = (<7i, (72, 03 ) T one obtains then 

id t <j) = a -fix + QV 4> + m(j) (10.348) 

id t x = B ■ rf(j> + qV x ~ mx ■ (10.349) 

We want to focus on the stationary positive energy solution. This solution exhibits a time- 
dependence exp[— i(m + e)t] where for e holds in the non-relativistic limit |e| << m. Accordingly, 
we define 

tf,^) = e~ imt <S>{x») (10.350) 

x (x») = e- imt X{x») (10.351) 
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and assume that for the time-derivative of and X holds 







d t X 


<< m , 




X 



« m . 



Using (10.350, 10.351) in (10.348, 10.349) yields 

id t <S> = g-ttX + qV<$> 

id t X = <?•#<!> + qV X - 2mX . 

The properties (10.346, 10.352) allow one to approximate (10.354) 

« 2m X . 

and, accordingly, one can replace X in (10.353) by 



X 



a ■ 7r 
2m 



<l> 



(10.352) 



(10.353) 
(10.354) 



(10.355) 



(10.356) 



to obtain a closed equation for 



a ■ tt 



2m 



$ + qV$ 



(10.357) 



Equation (10.356), due to the mr x factor, identifies X as the small component of the bi-spinor 
wave function which, henceforth, does not need to be considered anymore. 

Equation (10.357) for <E> can be reformulated by expansion of [a • tt) 2 . For this purpose we employ 
the identity (5.230), derived in Sect. 5.7, which in the present case states 



(<t • tt) 2 = tt + i a ■ (tt x if ) . 
For the components of tt x tt holds 

^7fX7?^ = e jke (TTjTT k - TT k TTj) = e jk e[TTj, TT(\ 

We want to evaluate the latter commutator. One obtains 
[ttj, TT k ] = [^dj + qAj, ^d k + qA k ] 

= [-rdj, -^k] +q[Aj, jd k ] + q[-rdj, A k ] + q 2 [Aj, A k ] 



(10.358) 



(10.359) 



o 







= \[A 3 ,d k ] + ±[dj,A k ]. 
For an arbitrary function f[f) holds 

( [Aj, d k ] + [dj, A k ] )f = ( djA h - A k dj + Ajd k - d k Aj ) / 



(10.360) 



(10.361) 
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Using 

djA k f = {{d 3 A k ))f + A k djf 
d k Ajf = ((d k Aj))f + Ajdkf 

where {{dj ■ ■ •)) denotes confinement of the differential operator to within the brackets ((• • •)), one 
obtains 

([A 3 ,d k ] + [dj,A k })f = [{{d 3 A k )) - {{d k A 3 ))]f (10.362) 
or, using (10.360) and A^ = (V, -A), 

[Tr^TTfc] = %{{djA k - d k A 3 )) = -% (Vxl) e jke = -%B t e jhl (10.363) 

where we employed B(r,t) = V x A(r,t) [see (8.6)]. Equations (10.344, 10.358, 10.359, 10.363) 
allow us to write (10.357) in the final form 



id t <&(r, t) 



[p-ql{f,t)} 2 q _ g 

<r • B(r,t) + g l/(r,t) 



$(f,t) (10.364) 



2m 2m 
which is referred to as the Pauli equation. 

Comparision of (10.364) governing a two-dimensional wave function $ 6 C 2 with the corresponding 
non-relativistic Schrodinger equation (10.2) governing a one-dimensional wave function ip £ C, 
reveals a stunning feature: the Pauli equation does justice to its two-dimensional character; while 
agreeing in all other respects with the non-relativistic Schrodinger equation (10.2) it introduces the 
extra term qa ■ B & which describes the well-known interaction of a spin-^ particle with a magnetic 
field B. In other words, the spin-^ which emerged in the Lorentz-invariant theory as an algebraic 
necessity, does not leave the theory again when one takes the non-relativistic limit, but rather 
remains as a steady "guest" of non-relativistic physics with the proper interaction term. 
Let us consider briefly the consequences of the interaction of a spin-^ with the magnetic field. For 
this purpose we disregard the spatial degrees of freedom and assume the Schrodinger equation 

id t <f>(t) = qa-B<Z>{t) . (10.365) 

The formal solution of this equation is 

$(t) = e - iqtS ff <f>(0) . (10.366) 

Comparision of this expression with (5.222, 5.223) shows that the propagator in (10.366) can be 
interpreted as a rotation around the field B by an angle qtB, i.e., the interaction qa ■ B induces a 
precession of the spin-^ around the magnetic field. 



Dirac Particle in Coulomb Field - Spectrum 

We want to describe now the spectrum of a relativistic electron (q = — e) in the Coulomb field of 
a nucleus with charge Ze. The respective bispinor wave function ^f(x^) G C 4 is described as the 
stationary solution of the Dirac equation (10.343) for the vector potential 

Ze 2 

Afj, = ( , 0, 0, 0) . (10.367) 
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For the purpose of the solution we assume the chiral representation, i.e, we solve 

[i^idf, + iqA^) - m] *(a^) = (10.368) 

where ^(x^) and 7^ are defined in (10.228) and in (10.229), respectively. Employing tt^ as defined 
in Table 10.1 one can write (10.368) 

(f7r„ - m) *(o^) = 0. (10.369) 

For our solution we will adopt presently a strategy which follows closely that for the spectrum of 
pionic atoms in Sect. 10.6. For this purpose we 'square' the Dirac equation, multiplying (10.369) 
from the left by j v n v + m. This yields 

[iTidfi + iqA^) + m] [if^ + - m] §(x M ) 

= (T^T^TTi/ - m 2 )§(x M ) = 0. (10.370) 

Any solution of (10.368) is also a solution of (10.370), but the converse is not necessarily true. 
However, once a solution ^(rc M ) of (10.370) is obtained then 

[ifidp + iqA„) + m] (10.371) 

is a solution of (10.369). This follows from 

[if {d» + n\) + m] [if{d„ + iqAJ - m] 

= [ir(d„ + iqA^) - m] [zf(^ + ig^) + m\ (10.372) 

according to which follows from (10.370) 

[ifidp + iqA^) - m] [ifCfy + iqAJ + m] *(a^) = (10.373) 

such that we can conclude that (10.371), indeed, is a solution of (10.369). 

Equation (10.370) resembles closely the Klein-Gordon equation (10.180), but differs from it in an 
essential way. The difference arises from the term 7 M 7r M 7^77, in (10.370) for which holds 

3 _ 

r^r*v = E wfK + E • ( 10 - 374 ) 

The first term on the r.h.s. can be rewritten using, according to (10.230), (7 ) 2 = 1 and (t 5 ) 2 = 
-l,i = 1,2,3, 

E(f) 2 ^ = *o - **■ (10-375) 

Following the algebra that connected Eqs. (5.231), (5.232) in Sect. 5.7 one can write the second 
term in (10.374), noting from (10.230) 7^7^ = —7^7^, fj, 7^ v and altering 'dummy' summation 
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indices, 

E 7 M 7%^ = \ E ( 7^7%^ + l^T^u, , 



2 



fl,V=l 

= \ E ^ ^ fr» **] ( 10 - 376 ) 

|U,I/=1 
M 7^ 

This expression can be simplified due to the special form (10.367) of A^, i.e., due to A = 0. Since 

[tt m , tt^] =0 for/i,i/ = 1,2,3 (10.377) 
which follows readily from the definition (10.344), it holds 

\ E ^ 7"] **] 



4 

/i,i/=i 



3 1 3 



= 4 E fr°> ^ [*°> ^-i + 4 E 7 ] ^0] 

i=i 3=1 
= 5 ?'][*»>,*;]■ (10.378) 



j'=i 

According to the definition (10.229), the commutators [7 , 7 J ] are 

17 ' 7 J ^ 11 J \ a* J \<t3 OyV 10 

(7^ 
-a j 



(10.379) 



The commutators [ttq, itj] in (10.378) can be evaluated using (10.367) and the definition (10.344) 

[tto, TTj] = (-id t + qA , -id 3 ] = ~[{d t + iqA ) d 3 - d 3 (d t + iqA )]f 

= i((d jq A ))f (10.380) 

where / = f(f,t) is a suitable test function and where ((•••)) denotes the range to which the 
derivative is limited. Altogether, one can summarize (10.376-10.380) 

E = i ( ) ■ (( V ^o)) (10.381) 
According to (10.367) holds 

Ze 2 

VqA = r — . (10.382) 
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where r = r/\r\ is a unit vector. Combining this result with (10.381), (10.374), (10.375) the 
'squared' Dirac equation (10.368) reads 



t 



Ze' 



+ V 2 + i 



a • r 






-a ■ r 



Ze 2 



m 



^(x M ) = 



We seek stationary solutions of this equation. Such solutions are of the form 



(10.383) 



(10.384) 



e can be interpreted as the energy of the stationary state and, hence, it is this quantity that we 
want to determine. Insertion of (10.384) into (10.383) yields the purely spatial four-dimensional 
differential equation 



e + 



Ze 2 



2\ 2 



+ V 2 + i 



a ■ r 
—a ■ f 



Ze 2 



— m 



$(r) 



. 



(10.385) 



We split the wave function into two spin-^ components 

#(r) = 



Mr) 

Mr) 



(10.386) 



and obtain for the separate components <f>±(r) 



Z& 



2\ 2 



r*2 , ^ e 2 

+ V ± i a ■ r — — m 



Mr) = . 



(10.387) 



The expression (10.189) for the Laplacian and expansion of the term (• • -) 2 result in the two- 
dimensional equation 



L 2 - Z 2 e A ^ia-r Ze 2 2Ze 2 e 

H h e - m 2 



r 4>±{r) = 



(10.388) 



Except for the term ia ■ f this equation is identical to that posed by the one-dimensional Klein- 
Gordon equation for pionic atoms (10.191) solved in Sect. 10.6. In the latter case, a solution of 
the form ~ Y^ m {f) can be obtained. The term ia ■ f, however, is genuinely two-dimensional and, in 
fact, couples the orbital angular momentum of the electron to its spin-|. Accordingly, we express 
the solution of (10.388) in terms of states introduced in Sect. 6.5 which describe the coupling of 
orbital angular momentum and spin 



-j + i,...+j} 



1 3 

2 ' 2 



; m 



-J, 



(10.389) 



According to the results in Sect. 6.5 the operator ia -r is block-diagonal in this basis such that only 
the states for identical j, m values are coupled, i.e., only the two states {3^' m (J — |, f l r )> 3^m(i + 
|, ||f)} as given in (6.147, 6.148). We note that these states are also eigenstates of the angular 
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momentum operator L 2 [cf. (6.151]. We select, therefore, a specific pair of total spin-orbital angular 
momentum quantum numbers j, m and expand 



Using 



(10.390) 
(10.391) 



a-ry jm (j± i,i|f) = -y jm {jThk\r) 

derived in Sect. 6.5 [cf. (6.186)], property (6.151), which states that the states 3^ m (j ± |, ||f) are 
eigenfunctions of L 2 , together with the orthonormality of these two states leads to the coupled 
differential equation 



d 2 + 



2Ze 2 e 



+ e — m 



±i Ze 2 




(10.392) 



Z 2 e 



4 ±iZe 2 



h±(r) 
9±{r) 



We seek to bring (10.392) into diagonal form. Any similarity transformation leaves the first term in 
(10.392), involving the 2x2 unit matrix, unaltered. However, such transformation can be chosen 
as to diagonalize the second term. Since, in the present treatment, we want to determine solely the 
spectrum, not the wave functions, we require only the eigenvalues of the matrices 



±iZe 2 (j + DO' + f) - ZV 



(10.393) 



but do not explicitly consider further the wavefunctions. Obviously, the eigenvalues are independent 
of m. The two eigenvalues of both matrices are identical and can be written in the form 



where 



Ai(j)[Ai(j) + 1] and A 2 (j) [X 2 (j) + 1] 
AiO") = ^(j + |) 2 - Z 2 e± 



A 2 (j) 



i) 2 - Z 2 e 4 



Equation (10.392) reads then in the diagonal representation 



Ai, 2 (i)[Ai, 2 (i) + l) , 2eZe 2 



+ 



+ e z 



m 



/i,2(r) 







(10.394) 

(10.395) 
(10.396) 

(10.397) 



This equation is identical to the Klein-Gordon equation for pionic atoms written in the form 
(10.200), except for the slight difference in the expression of Ai i2 (i) as given by (10.395, 10.396) 
and (10.199), namely, the missing additive term — i, the values of the argument of Ai )2 (j) being 
j = 1, §, . . . rather than £ = 0, 1, ... as in the case of pionic atoms, and except for the fact that 
we have two sets of values for Ai j2 (j), namely, Ai(j) and A 2 (j).. We can, hence, conclude that the 
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spectrum of (10.397) is again given by eq. (10.203), albeit with some modifications. Using (10.395, 
10.396) we obtain, accordingly, 

; (10.398) 



1 + 



Z 2 e 4 



71% 

£ 2 = z2g4 ; (10-399) 



(n' + l+yO-+i)2-Z2 e 4)2 

1 + 

ri = 0,1,2,... , j =i,|,..., m = + 

where t\ corresponds to Ai(j') as given in (10.395) and €2 corresponds to X2U) as given in (10.396). 
For a given value of n! the energies e\ and €2 for identical 7-values correspond to mixtures of states 
with orbital angular momentum £ = j — \ and £ = 7 + |. The magnitude of the relativistic effect 
is determined by Z 2 e 4 . Expanding the energies in terms of this parameter allows one to identify 
the relationship between the energies e± and €2 and the non-relativistic spectrum. One obtains in 
case of (10.398, 10.399) 

£1 « m - H jf* + > ? + (10-400) 

£2 * m " 2 (n'Ti + |)2 + ( 10 - 401 ) 
n' = 0,1,2,... , 7 = 1,3,..., m = -7,-7 + 1,..., j . 

These expressions can be equated with the non-relativistic spectrum. Obviously, the second term on 
the r.h.s. of these equations describe the binding energy. In case of non-relativistic hydrogen-type 
atoms, including spin-|, the stationary states have binding energies 

mZV n = l,2,... t = 0,l,...,n-l 

E ~-^T> m = -e-i+l,...,l m s = n ■ (10 ' 402) 

In this expression n is the so-called main quantum number. It is given by n = n' + I + 1 where £ 
is the orbital angular momentum quantum number and n! = 0, 1, . . . counts the nodes of the wave 
function. One can equate (10.402) with (10.400) and (10.401) if one attributes to the respective 
states the angular momentum quantum numbers £ = j ; + | and t = j ' — \. One may also state 
this in the following way: (10.400) corresponds to a non-relativistic state with quantum numbers 
n, I and spin-orbital angular momentum 7 = (10.401) corresponds to a non-relativistic state 

with quantum numbers n, £ and spin-orbital angular momentum j = £ + \ . These considerations 
are summarized in the following equations 

TYl 

E D (n,e,j = £-i,m) = . ; (10.403) 

ll + / 2e4 

(n-<?+^/(<?+l) 2 -Z 2 e 4 ) 2 
771 

E B (n,£,j = £+i,m) = ; (10.404) 

h _i_ z e 

V (n-^-l + V^-^e 4 ) 2 

rc=l,2,...; £ = 0,1,..., n- 1 ; m = -7, -7 + 1, . . . , 7 
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Table 10.2: 

Binding energies for the hydrogen [Z = 1) atom. Degeneracies are denoted by f|\ The energies 
were evaluated with m = 511.0041 keV and e 2 = 1/137.036 by means of Eqs. (10.402, 10.405). 



One can combine the expressions (10.403, 10.404) finally into the single formula 
Er>(n,l,j,m) = n 1,2,... 

V ' ' J ' ' i = 0,1,. ..,71- 1 

m . fi for< = 



l\ + ^ 4 J ~ \ i ± i otherwise 

("-i-§ + VCj+§) a -^) a m = -j, -j + 

(10.405) 

In order to demonstrate relativistic effects in the spectrum of the hydrogen atom we compare 
in Table 10.2 the non-relativistic [cf. (10.402)] and the relativistic [cf. (10.405)] spectrum of the 
hydrogen atom. The table entries demonstrate that the energies as given by the expression (10.405) 
in terms of the non-relativistic quantum numbers n,t relate closely to the corresponding non- 
relativistic states, in fact, the non-relativistic and relativistic energies are hardly discernible. The 
reason is that the mean kinetic energy of the electron in the hydrogen atom, is in the range of 
10 eV, i.e., much less than the rest mass of the electron (511 keV). However, in case of heavier 
nuclei the kinetic energy of bound electrons in the ground state scales with the nuclear charge Z 
like Z 2 such that in case Z = 100 the kinetic energy is of the order of the rest mass and relativistic 
effects become important. This is clearly demonstrated by the comparision of non-relativistic and 
relativistic spectra of a hydrogen-type atom with Z = 100 in Table 10.3. 

Of particular interest is the effect of spin-orbit coupling which removes, for example, the non- 
relativistic degeneracy for the six 2p states of the hydrogen atom: in the present, i.e., relativistic, 
case these six states are split into energetically different 2pi and 2p3 states. The 2pi states with 

2 2 2 

j = \ involve two degenerate states corresponding to y\ m {\, \\f) for m = ±|, the 2^3 states with 
j = | involve four degenerate states corresponding to 3^3 m (l, ||f) for m = ±|, ±|. 
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Table 10.3: 

Binding energies for the hydrogen-type (Z = 100) atom. Degeneracies are denoted by f|\ The 
energies were evaluated with m = 511.0041 keV and e 2 = 1/137.036 by means of Eqs. (10.402, 
10.405). 



In order to investigate further the deviation between relativistic and non-relativistic spectra of 
hydrogen-type atoms we expand the expression (10.405) to order 0(Z 4 e s ). Introducing a = Z 2 e 4 
and p = j+ \ (10.405) reads 

1 (10.406) 
1 + ~~ / 

(n-(3+ V/3 2 -") 2 

The expansion (10.206) provides in the present case 



En(n,£,j,m) ~ m — 



mZ 2 e* 



m 



2n 3 



3 + 



3 

An 



+ 0{Z b e 



6„12\ 



(10.407) 



This expression allows one, for example, to estimate the difference between the energies of the 



3 1 



states 2ps and 2pi (cf. Tables 10.2,10.3). It holds for n = 2 and j = |, 2 



£(2pa) -E(2pi) 



v4 8 

mZ e 



mZ 4 e s 
32 



(10.408) 



Radial Dirac Equation 

We want to determine now the wave functions for the stationary states of a Dirac particle in a 
4-vector potential 

= (V(r), 0,0,0) (10.409) 

where V{r) is spherically symmetric. An example for such potential is the Coulomb potential 
V(r) = —Ze 2 /r considered further below. We assume for the wave function the stationary state 
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form 



—let 



<£(f) 

X(r) 



(10.410) 



where <3?(r), X{f) G C 2 describe the spatial and spin-| degrees of freedom, but are time- 
independent. The Dirac equation reads then, according to (10.232, 10.345), 



o-pX + + V(r) $ = e$ 

a-"p<S> - mX + V(r)X = eX 



(10.411) 
(10.412) 



In this equation a coupling between the wave functions <&(r) and X{r) arises due to the term a - p. 
This term has been discussed in detail in Sect. 6.5 [see, in particular, pp. 168]: the term is a scalar 
(rank zero tensor) in the space of the spin-angular momentum states yj m (j ± |, ||f) introduced in 
Sect. 6.5, i.e., the term is block-diagonal in the space spanned by the states yj m (j ' ± §, \\?) and does 
not couple states with different j, m-values; a -p has odd parity and it holds [c.f. (6.197, 6.198)] 



a -p g(r)y jm (j - §, ||f) 



= i 



d r + 



d r + 



i+1 



f{r)y jm {j-hW) 



g{r)y jrn {j + hh\r) 



(10.413) 



(10.414) 



These equations can be brought into a more symmetric form using 



d r + 



d r r 



which allows one to write (10.413, 10.414) 
a-prf(r)y jm (j + |,i|f) = 

a-prg(r)y jm (j - |, \\r) = 



d r + 



d r — 



3 + 



rf(r)y jm (j 



rg(r)y jm (j + |, \\r) 



(10.415) 



(10.416) 



The differential equations (10.411, 10.412) are four-dimensional with r-dependent wave functions. 
The arguments above allow one to eliminate the angular dependence by expanding <I>(r) and X(r) 
in terms of y jm (j + §, \\r) and y jm {j - §, ||f), i.e., 




i i f,) 

21 2 I' I 



\ 



(10.417) 
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In general, such expansion must include states with all possible j, m values. Presently, we consider 
the case that only states for one specific j,m pair contribute. Inserting (10.417) into (10.411, 
10.412), using (10.415, 10.416), the orthonormality property (6.157), and multiplying by r results 
in the following two independent pairs of coupled differential equations 



3 + 



8 r + 



3 + 



and 



d r + 



3 + 



d r — 



3 + 



d{r) + [ m + V(r) - e]a(r) = 

a(r) + \-m + V(r) - e]d{r) = 

c(r) + [ m + V(r) - e]b(r) = 

b(r) + [-m + V(r) - e] c(r) = . 



(10.418) 



(10.419) 



Obviously, only a(r),d(r) are coupled and 6(r),c(r) are coupled. Accordingly, there exist two 
independent solutions (10.417) of the form 



$(r) 

m 

$(f) 

m 



y ^jmu 2> 2 1' J y 

^ -^3; im (j + i,i|f) y 



(10.420) 



(10.421) 



where the factors i and —1 have been introduced for convenience. According to (10.418) holds for 
fi(r), 9iir) 



Or 



8 r + 



3 + 



J + -2 



and for / 2 (r), g 2 (r) 



d r + 



Or 



3 + 



3 + 



9l (r) + [e - m - V{r)]h{r) 

fi(r) - [e + m - V(r)]gi(r) 

9i{r) + [e - m - V(r)]f 2 (r) 

f2(r) - [e + m - V(r)]g 2 (r) 







(10.422) 



(10.423) 
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Equations (10.422) and (10.423) are identical, except for the opposite sign of the term (j + §); the 
equations determine, together with the appropriate boundary conditions at r = and r — > oo, 
the radial wave functions for Dirac particles in the potential (10.409). 



Dirac Particle in Coulomb Field - Wave Functions 

We want to determine now the wave functions of the stationary states of hydrogen-type atoms 
which correspond to the energy levels (10.405). We assume the 4- vector potential of pure Coulomb 
type (10.367) which is spherically symmetric such that equations (10.422, 10.423) apply for V(r) = 
—Ze 2 /r. Equation (10.422) determines solutions of the form (10.420). In the non-relativistic limit, 
<E> in (10.420) is the large component and X is the small component. Hence, (10.422) corresponds 
to states 

*(.r'<) « | '^^ y .nn(j + hh\rj | ; (10.424) 

i.e., to states with angular momentum £ = j + |. According to the discussion of the spectrum 
(10.405) of the relativistic hydrogen atom the corresponding states have quantum numbers n = 
1,2, . . . , I = 0, 1, ... , n — 1. Hence, (10.422) describes the states 2pi, 3pi, 3^3 , etc. Similarly, 

2 2 2 

(10.423), determining wave functions of the type (10.421), i.e., in the non-relativistic limit wave 
functions 

■Mr),, r j ir x 
i]> (.,./') v j * r Sjrn{3 2- -iin J . (1.0.125) 

covers states with angular momentum I = 2 ~ \ and, correspodingly the states 
lsi, 2s i, 2p3, 3s i, 3j»3, 3ds, etc. 

2 2 2 2 2 2 

We consider first the solution of (10.422). The solution of (10.422) follows in this case from the same 
procedure as that adopted for the radial wave function of the non-relativistic hydrogen-type atom. 
According to this procedure, one demonstrates first that the wave function at r — > behaves as r 7 
for some suitable 7, one demonstrates then that the wave functions for r — > 00 behaves as exp(— fir) 
for some suitable fi, and obtains finally a polynomial function p(r) such that r 7 exp(— fir)p(r) solves 
(10.422); enforcing the polynomial to be of finite order leads to discrete eigenvalues e, namely, the 
ones given in (10.405). 



Behaviour at r — > 



We consider first the behaviour of the solutions /i(r) and g\{r) of (10.422) near r 
that (10.422), for small r, can be written 



0. We note 



d r + 



3 + 



Ze 2 

9i(r) + — /i(r) 



fi(r) - —9i(r) 



(10.426) 
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Setting 
yields 



/i(r) ~ or 7 , gi (r) ~ br~* (10.427) 
o o 



7&r 7-i _ (^+1)5^-1 + ^ar^ 1 = 

7 ar 7 ~ 1 + (j' + ^ar 7 " 1 - Ze 2 br^~ 1 = 0. (10.428) 



or 



7 + (j + I) -Ze* 



This equation poses an eigenvalue problem (eigenvalue —7) for proper 7 values. One obtains 
7 = ±yj (j + ^) 2 — Z 2 e 4 . The assumed r-dependence in (10.427) makes only the positive solution 
possible. We have, hence, determined that the solutions /i(r) and g± (r), for small r, assume the 
r-dependence in (10.427) with 

7 = \/ti + \) 2 ~ Z^. (10.430) 

Note that the exponent in (10.427), in case j + \) 2 < Ze 2 , becomes imaginary. Such r-dependence 
would make the expectation value of the potential 

r 2 drp(r)^- (10.431) 

infinite since, according to (10.266, 10.267, 10.420), for the particle density holds then 



p(r) ~ jr 7 " 1 ] 2 = \ . (10.432) 



Behaviour at r — > 00 

For very large r values (10.422) becomes 

d r gi(r) = -(e - m,)/i(r) 

r /i(r) = (e + m)^i(r) (10.433) 
Iterating this equation once yields 

a r 2 5l (r) = ( m 2 - e 2 ) 5l (r) 

9 2 /i(r) = (m 2 -e 2 )/i(r) (10.434) 



The solutions of these equations are /i,<?i ~ exp(± yj m 2 — e 2 r) . Only the exponentially decaying 
solution is admissable and, hence, we conclude 

/i(r) ~ e^ r , 5l (r) ~ e"^ , /z = \/™ 2 - e 2 (10.435) 
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For bound states holds e < m and, hence, p is real. Let us consider then for the solution of (10.434) 

/i(r) = y /m + eae~' ir , gi(r) = -y/m-ea e^ r . (10.436) 
Insertion into (10.434) results in 

(m — e) y/m + e a — (m — e)\Jm + e a = 

(m + e) \Jm — e a — (rn + e) Vm — e a = (10.437) 
which is obviously correct. 

Solution of the Radial Dirac Equation for a Coulomb Potential 

To solve (10.422) for the Coulomb potential V{r) = —Ze 2 /r We assume a form for the solution 
which is adopted to the asymptotic solution (10.436). Accordingly, we set 

f^r) = Vm + ee^/iW (10.438) 

9l (r) = -y/m^ee'^ gi{r) (10.439) 

where fi is given in (10.435 ). Equation (10.422 ) leads to 

j + i Ze 2 ~ 

-yjm - e [d r — ] gi + \frn + e — /i + 

(m - e) y/m + e gi — {m- e) v/rn+~e /i = (10.440) 

Vm + e [d r H f\ + v 7 ^ - e 9i ~ 

r r 

(m + e)\fm - e/i + (m + e)\/m — egi = (10.441) 

The last two terms on the l.h.s. of both (10.440) and (10.441) correspond to (10.437) where they 

cancelled in case fi = gi = a. In the present case the functions f\ and g\ cannot be chosen 
identical due to the terms in the differential equations contributing for finite r. However, without 
loss of generality we can choose 

fi(r) = Mr) + Mr) , 9l(r) = Mr) ~ Mr) (10.442) 

which leads to a partial cancellation of the asymptotically dominant terms. We also introduce the 
new variable 

p = 2fir . (10.443) 

From this results after a little algebra 



[d p - J -^\ (0i - M - \f^^ — (01 + M + 02 = (10.444) 
p V rn — e p 



[d p + ^] (01 + 2 ) + J^- 1 — (01 - 02) - 02 = . (10.445) 
p V rn + e p 
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Addition and subtraction of these equations leads finally to the following two coupled differential 
equations for (f)\ and <p2 



eZe 2 



\fm 2 — e 2 p 



mZe 2 

ii - ; = = 02 = 



'\J~Tf 2 — c2 



'vnr — e z p 



d p (f>2 + 



3 + 



h + 



mZe 2 



V m 2 — e 2 p 



h + 



eZe 2 



V m 2 — e 2 p 



(10.446) 
(10.447) 



We seek solutions of (10.446 , 10.447) of the form 

n' 

Mp) = p j J2u s p 2 

s=0 
n' 

Mp) = p'Y.^p 2 



(10.448) 
(10.449) 



s=0 



for 7 given in (10.430) which conform to the proper r — > behaviour determined above [c.f. 
(10.426-10.430)]. Inserting (10.448, 10.449) into (10.446, 10.447) leads to 



E 



E 



(a + 7) a s + (j + \) S 
mZe 2 



eZe 2 



v 7 m 2 — c 



a. 



\Jm 2 — 



■08 



P 



s+7-1 







(10.450) 



/ \ n i ■ in mZe 2 
(s + 7)p2 + U + oJas H . a. 



\/w. 2 — e 2 



eZe 2 



From (10.450) follows 



From (10.451) follows 



m 2 — e 2 



mZe 2 



0s ~ 0,-1 



ti + l) 



= o 



0s s + 7 



eZe 2 



(10.451) 



(10.452) 



08-1 = (s + 7 + 



\/w, 2 — e 2 



A + 



" (J + 2") 2 



- + 7 - ^ ^ 



(s + 7 )2 + Z 2 e 4_ (i+ l )2 



Vm — e 



(10.453) 



Using (10.430) one can write this 



0s 



eZe 2 



s{s + 2 7 ) 



's-l 



(10.454) 
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Defining 



one obtains 



eZe 2 

so = AA^r " 7 (10.455) 
V fn — e 



5 — 5^ 

s(s + 27) 



(s-l-s )(s-s ) 

Ps-2 



(s-l)s(s-l + 2 7 )(s + 2 7 ) 

(1 - s Q )(2 - s )...(s- s ) 
s!(2 7 + l)(2 7 + 2)---(2 7 + s 



Po (10.456) 



From (10.452) follows 



= J + 5 "^? (-.o)(l-^)(2- 8o )-( a - a o) ^ (10 _ 457) 

So s!(2 7 + l)(2 7 + 2)---(2 7 + s) MU v 7 

One can relate the polynomials <j)\{p) and 4>2{p) denned through (10.448, 10.449) and (10.456, 
10.457) with the confluent hypergeometric functions 

„, , a a(a + 1) x 2 ,„ „ , . 

F a,c;x = 1 + -g + ( — + ... 10.458 
c c(c + l) 2! 

or, equivalently, with the associated Laguerre polynomials 

L(f) = F(-n,ct + l,x) . (10.459) 

It holds 

■ , 1 mZe 2 



MP) = 0o : 2 P 7 i ? (-^2 7 + l;p) (10.460) 

2 (p) = /3 p 7 i ? (l-s o ,2 7 + l; / 9) . (10.461) 



In order that the wave functions remain normalizable the power series (10.448, 10.449) must be of 
finite order. This requires that all coefficients a s and (5 S must vanish for s > n' for some n' £ N. 
The expressions (10.456) and (10.457) for (3 S and a s imply that s Q must then be an integer, i.e., 
s = n' . According to the definitions (10.430, 10.455) this confinement of s Q implies discrete values 
for e, namely, 



777 

e(n') = , ri = 0,1,2,... (10.462) 



1 + z2e4 



(n' + ^O'+D 2 -^) 2 

This expression agrees with the spectrum of relativistic hydrogen-type atoms derived above and 
given by (10.405). Comparision with (10.405) allows one to identify n' = n — j + \ which, in fact, 

is an integer. For example, for the states 2pi , 3pi , 3d-i holds n' = 1, 2, 1. We can, hence, conclude 

222 

that the polynomials in (10.461 for e values given by (10.405) and the ensuing s values ( 10.455) 
are finite. 
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Altogether we have determined the stationary states of the type (10.421) with radial wave functions 
fi(r), gi(r) determined by (10.438, 10.439), (10.442), and (10.460, 10.461). The coefficients (3 in 
(10.460, 10.461) are to be chosen to satisfy a normalization condition and to assign an overall 
phase. Due to the form (10.410) of the stationary state wave function the density p(x^) of the 
states under consideration, given by expression (10.267), is time- independent. The normalization 
integral is then 



poo rn /»2"7r 

/ r 2 dr / smOdO d<f> (|$(r)| 2 + \X{r)\ 2 ) = 
Jo Jo Jo 



(10.463) 



where and X, as in (10.421), are two-dimensional vectors determined through the explicit form 
of the spin-orbital angular momentum states yj m (j ± \, in (6.147, 6.148). The orthonormality 
properties (6.157, 6.158) of the latter states absorb the angular integral in (10.463) and yield [note 
the l/r factor in (10.421)] 

dr(\h(r)\ 2 + \ 9l (r)\ 2 ) = 1 (10.464) 



/ 

J o 



The evaluation of the integrals, which involve the confluent hyper geometric functions in (10.460, 
10.461), can follow the procedure adopted for the wave functions of the non-relativistic hydrogen 
atom and will not be carried out here. 

The wave functions (10.421) correspond to non-relativistic states with orbital angular momentum 
£ = j + \. They are described through quantum numbers n, j, i = j + ^,m. The complete wave 
function is given by the following set of formulas 



*(n,j,£ 



j + ^,m\ " 



-iet 



1 Mr) 
2' 2r > 



iF 1 (r)y j , m (j + 



Gi(r)y jtm ti-l,±\r) 



Fi(r) = F_(rc|r) , Gi(r) = F + (/c|r) , k = j + 



(10.465) 



(10.466) 



where 



F±( K \r) = ^N(2fir) 



7-l g -/ir 



(n' + 7)771 



— K 



F(-n',2 7 + l;2/jr) 



± n'F(l -n',2 7 + l;2/xr)| 



N 



r(2 7 + i) 



\ 



4777 



m =F e)r(2 7 + ri + 1) 

(n'+j)m j (n'+7)m 



«) 77'! 



(10.467) 



(10.468) 



and 



H = sjivn - e)(m + e) 

7 = y/U + h) 2 -^ 



n 



77 



3 ~ \ 
777 



(10.469) 



2 This formula has been adapted from "Relativistic Quantum Mechanics" by W. Greiner, (Springer, Berlin, 1990), 
Sect. 9.6. 
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We want to consider now the stationary states of the type (10.421) which, in the non-relativistic 
limit, become 

/ . Mr)^ / ■ i ii^ \ 
« e~ ut l — f ^-yjrn{3-2,2\r) (10.470) 

V / 

Obviously, this wavefunction has an orbital angular momentum quantum number I = 3 — \ and, 
accordingly, describes the complementary set of states lsi, 2s i, 2y>3, 3s i, 3p3, 3ds, etc. not not 

2 2 2 2 2 2 

covered by the wave functions given by (10.465-10.469). The radial wave functions fi(r) and g2(r) 
in (10.421) are governed by the radial Dirac equation (10.423) which differs from the radial Dirac 
equation for /i(r) and <7i(r) solely by the sign of the terms (j + \)/r. One can verify, tracing all 
steps which lead from (10.422) to (10.469) that the following wave functions result 

., / iF 2 (r)y j , m (j- h\\r) \ 
V(n,j,e = j- L m\x») = e- iet \ (10.471) 

2 V G 2 (r) yj , m (j + hm J 

F 2 {r) = F-(n\r) , G 2 (r) = F+( K \r) , n = -j-\ (10.472) 

where F±(n\r) are as given in (10.467-10.469). We have, hence, obtained closed expressions for the 
wave functions of all the stationary bound states of relativistic hydrogen-type atoms. 
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Chapter 11 



Spinor Formulation of Relativistic 
Quantum Mechanics 



11.1 The Lorentz Transformation of the Dirac Bispinor 

We will provide in the following a new formulation of the Dirac equation in the chiral representation 
defined through (10.225-10.229). Starting point is the Lorentz transformation S(w, #) for the 
bispinor wave function ^ in the chiral representation as given by (10.262). This transformation 
can be written 

- ( f & ) 

a(z) = exp(^z-a^\ (H-2) 

b(z) = exp^-^r-a) (11.3) 

z = w - iS . (11.4) 

We have altered here slightly our notation of S(w, $), expressing its dependence on w, $ through 
a complex variable z,z£ C 3 . 

Because of its block-diagonal form each of the diagonal components of S(z), i.e., a{z) and b(z), 
must be two-dimensional irreducible representations of the Lorentz group. This fact is remarkable 
since it implies that the representations provided through a(z) and b(z) are of lower dimension then 
the four-dimensional natural representation 1 L{w,dY v . The lower dimensionality of a(z) and b(z) 
implies, in a sense, that the corresponding representation of the Lorentz group is more basic than 
the natural representation and may serve as a building block for all representations, in particular, 
may be exploited to express the Lorentz-invariant equations of relativistic quantum mechanics. 
This is, indeed, what will be achieved in the following. 

We will proceed by building as much as possible on the results obtained sofar in the chiral repre- 
sentation of the Dirac equation. We will characterize the space on which the transformations a(z) 



1 We will see below that the representations a(z) and b(z) are, in fact, isomorphic to the natural representation, 
i.e., different L(w, "&Y u correspond to different a(z) and b(z). 
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and b(z) act, the so-called spinor space, will establish the map between L(w, i?)^ and a(z), b(z), 
express 4- vectors A^, A v , the operator and the Pauli matrices a in the new representation and, 
finally, formulate the Dirac equation, neutrino equation, and the Klein-Gordon equation in the 
spinor representation. 

A First Characterization of the Bispinor States 

We note that in case w = the Dirac transformations are pure rotations. In this case a(z) and 
b(z) are identical and read 

a(itf) = b(i$) = exp^—d-a^j , € M 3 . (11.5) 

The transformations in this case, i.e., for z = ifi^d £ M 3 , are elements of SU(2) and correspond, in 

(~) "* 

fact, to the rotational transformations of spin-| states as described by D £ usually expressed 

as product of rotations and of functions of Euler angles a, (3, 7 (see Chapter 5). For # = (0, (3, 0) T 
the transformations are 

= wu> = ( ) = ( ™:| ;::f ) . 

as given by (5.243). This characterization allows one to draw conclusions regarding the state space 
in which a(z) and b(z) operate, namely, a space of vectors (stated) ^ or wn i cn holds 

state 1 ~ \ _, = ^ j eR3 (1L?) 

state 2 ~ |±, — |) J 

where "~" stands for "transforms like". Here ||,±|) represents the familiar spin-^ states. 
Since a(z) acts on the first two components of the solution \& of the Dirac equation, and since b(z) 
acts on the third and fourth component of * one can characterize i& 



( *!(^) ~ \\,+\) \ 

* 2 (x") ~ \\-\) 

* 3 (*^) ~ \\,+\) 

V *4(^) ~ \\,-\) J 



i#, € M 3 . (11.8) 



We like to stress that there exists, however, a distinct difference in the transformation behaviour of 
*i(x^), ^2(2^) and ^(x^), ^(x^) in case z = u; + i$ for u; / 0. In this case holds a(z) / b(z) 
and *i(a;^), ^(a^ 4 ) transform according to a(z) whereas ^(x^), >3>4(x M ) transform according to 

6(5)- 

Relationship Between a(z) and b(z) 

The transformation 6(z) can be related to the conjugate complex of the transformation a(z), i.e., 
to 

a*(z) = exp(^z*-a*J (11.9) 
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where a* = (cr*, a%, a%). One can readily verify [c.f. (5.224)] 

(7i = a* , a 2 = -0*2 , 0-3 = 0-3 

From this one can derive 



b[z) = ea*{z)e- 1 



where 



1 
-1 



-1 

1 



(11.10) 
(11.11) 
(11.12) 



To prove (11.11) one first demonstrates that for e and e 1 as given in (11.12) does, in fact, hold 
ee' 1 = 1. One notices then, using ef(a)e~ 1 = /(eae -1 ), 



ea*(z)e 1 = exp 



z 



Explicit matrix multiplication using (5.224, 11.10, 11.12) yields 

-1 



eo\e = — a\ 
= -0-3 



£026 



-0-1 

3 . 



eae' 1 



-a 



-1 - ^* 
e ae = —a 



or in short 

Similarly, one can show 

a result to be used further below. Hence, one can express 

eo*(z)e _1 = exp ( — ^ z* ■ a^j = b(z) . 
We conclude, therefore, that the transformation (11.1) can be written in the form 

^ = ( ^ ea^lje- 1 



(11.13) 



(11.14) 

(11.15) 
(11.16) 

(11.17) 
(11.18) 



with a{z) given by (11.2, 11.4) and e,e 1 given by (11.12). This demonstrates that a(z) is the 
transformation which characterizes both components of S(z). 



Spatial Inversion 

One may question from the form of S(z) why the Dirac equation needs to be four-dimensional, 
featuring the components ^i(x M ), ^2(2^) as well as ^3(2;^), ^(x^) even though these pairs of 
components transform independently of each other. The answer lies in the necessity that application 
of spatial inversion should transform a solution of the Dirac equation into another possible solution 
of the Dirac equation. The effect of inversion on Lorentz transformations is, however, that they 
alter w into —w, but leave rotation angles 1? unaltered. 
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Let V denote the representation of spatial inversion in the space of the wave functions Obviously, 
V 2 = 1, i.e., V~ x = V. The transformation S(z) in the transformed space is then 



VS{w + i$)V = S{-w + i$) = ( b ^ ° 

\ a{z) 

i.e., the transformations a(z) and b(z) become interchanged. This implies 



(11.19) 



(11.20) 



Obviously, the space spanned by only two of the components of is not invariant under spatial 
inversion and, hence, does not suffice for particles like the electron which obey inversion symmetry. 
However, for particles like the neutrinos which do not obey inversion symmetry two components 
of the wave function are sufficient. In fact, the Lorentz invariant equation for neutrinos is only 
2-dimensional. 



11.2 Relationship Between the Lie Groups SL(2,C) and SO(3,l) 

We have pointed out that a(ii?),$ G M 3 , which describes pure rotations, is an element of SU(2). 
However, a(w + i-d) for w / is an element of 



SL(2, C) = { M, M is a complex 2 x 2-matrix, det(M) = 1 } . 
One can verify this by evaluating the determinant of a(z) 

det(a(z)) = det(e^") = e tr ^ s) = 1 

which follows from the fact that for any complex, non-singular matrix M holds 2 

det(e M ) = e tr ( M ) 

and from [c.f. (5.224)] 



tr( Oj 



0, j = 1,2,3. 



(11.21) 
(11.22) 

(11.23) 
(11.24) 



Exercise 11.2.1: Show that SL(2,C) defined in (11.21) together with matrix multiplication as 
the binary operation forms a group. 



2 The proof of this important property is straightforward in case of hermitian M (see Chapter 5). For the general 
case the proof, based on the Jordan-Chevalley theorem, can be found in G.G.A. Bauerle and E.A. de Kerf Lie 
Algebras, Part (Elsevier, Amsterdam, 1990), Exercise 1.10.3. 
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Mapping onto matrices M{A IX ) 

We want to establish now the relationship between SL(2,C) and the group C\ of proper, or- 
thochronous Lorentz transformations. Starting point is a bijective map between M 4 and the set of 
two-dimensional hermitian matrices defined through 

M(A") = op A* (11.25) 

where 



1 \ /OH / -s \ [1 
1 ) ' I 1 ) ' [ i ) ' I -1 



(11.26) 



V 00 01 02 03 / 

The quantity M thus defined does not transform like a covariant 4- vector. In fact, one wishes 
that the definition (11.25) of the matrix M{A IM ) is independent of the frame of reference, i.e., in a 
transformed frame should hold 

°pA» ^ . (11.27) 

Straightforward transformation into another frame of reference would replace by o'^. Using 
A^ = (L~ 1 )t J ' v A" / one would expect in a transformed frame to hold 

a^^a'^L- 1 )^^ . (11.28) 

Consistency of (11.28) and (11.27) requires then 

L„"c^ = o v (11.29) 

where we used (10.76). Noting that for covariant vectors according to (10.75) holds a! v = L^a^ 
one realizes that transforms inversely to covariant 4- vectors. We will prove below [cf. (11.135)] 
this transformation behaviour. 
M(A^) according to (11.25) can also be written 

& A l° - a» ) ■ < 1L30) 

Since the components of A^ are real, the matrix M(A^) is hermitian as can be seen from inspection 
of (11.26) or from the fact that the matrices <7o, 0i, 02, 03 are hermitian. The function M(A^) is 
bijective, in fact, one can provide a simple expression for the inverse of M(A tI ) 

M' = M(A») A* = ^(M'ap) . (11.31) 



Exercise 11.2.2: Show that 00,01,02,03 provide a linear-independent basis for the space of 
hermitian 2 x 2-matrices. Argue why M(A IJ ') = a^A^ provides a bijective map. Demonstrate that 
(11.31) holds. 



The following important property holds for M(A fl ) 

det ( M (A") ) = A^An (11.32) 

which follows directly from (11.30). 
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Transforming the matrices M(A fl ) 

We define now a transformation of the matrix M(A^) in the space of hermitian 2 x 2-matrices 

M M' = a Ma 1 " , a G 5L(2,C) . (11.33) 

This transformation conserves the hermitian property of M since 

(M')^ = (aMa)) ] = (a^iWV = a Ma) = M' (11.34) 

where we used the properties = M and (a^)^ = a. Due to det(a) = 1 the transformation 
(11.33) conserves the determinant of M. In fact, it holds for the matrix M' defined through (11.33) 

det(M') = det(oMa t ) = det(a) det(a t ) det(M) 

= [det(a)] 2 det(M) = det(M) . (11.35) 

We now apply the transformation (11.33) to M{A^) describing the action of the transformation in 
terms of transformations of M 1 . In fact, for any a £ SL(2, C) and for any there exists an A ,fM 
such that 

M(A'») = aM{A^)a) . (11.36) 

The suitable A^ can readily be constructed using (11.31). Accordingly, any a £ SL(2,C) defines 
the transformation [c.f. (11.31)] 

A^ A^ = ^tr [a M(A»)Ja^ . (11.37) 

Because of (11.32, 11.35) holds for this transformation 

A'^A'^ = A^Afj, (11.38) 

which implies that (11.37) defines actually a Lorentz transformation. The linear character of the 
transformation becomes apparent expressing A'^ as given in (11.37) using (11.25) 

A^ = -tr ( a a v a)a^ A v (11.39) 

which allows us to express finally 

A'" = L{aY v A v - L{ay v = ^(a^a 1 ^) . (11.40) 



Exercise 11.2.3: Show that L(a) fl u defined in (11.40) is an element of C 
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L(aY v provides a homomorphism 

We want to demonstrate now that the map between SL(2, C) and SO(3,l) defined through L{a)^ v 
[cf. (11.40)] respects the group property of SL(2, C) and of SO(3,l), i.e., 

L% = L{a^Y v L{a^Yp = L{ Yp (H-41) 

product product 
in SO(3,l) in SL(2,C) 

For this purpose one writes using tr(AB) = tv(BA) 

L(aiYvL(a 2 Yp = ^ ^tr ( aioya^ ) ^tr ( aia p a^a v ) 

V 

= y~" y ^tr ( q-^q^cr^ai) ^tr ( a 2 o>4ov ) ■ (11.42) 

V 

Defining 

T = a[a^ai , T' = a2<J p a\ (11.43) 
and using the definition of L^p in (11.41) results in 

L^p = ^ (^)a/3 r /3Q r 7<5( cr ^)«5 7 = \ Y A ^lS^ pa T 7 5 (11.44) 



is, a,. 



7,(5 7,5 

where 

A a p 7 § = ^2(vv) af3 (vv)sy • (11.45) 

V 

One can demonstrate through direct evaluation 



A 



a/3-yS 



' 2 


Q = 


= 


7 


= 5 = 1 




2 


Q = 


= 


7 


= 5 = 2 




< 2 


a = 


7 = 


1, 


(5 = 8 = 


2 


2 


a = 


7 = 


2, 


(3 = 5 = 


1 


, o 


else 











which yields 



l% = \ (r n r' n + r 22 r 22 + r 12 r' 21 + r 21 r' 12 ) = Jtr(rr') 



2 
1 

~~ 2 
1 

2 

This completes the proof of the homomorphic property of L{aYv 



(11.46) 



^tr (^a\a ll aia2(Tpa < 2 ^ = - tr ( a ^aia2(J p a\a\ ) 

^tr ^aia 2 cr p (aia2) t o- M ^ = L(aia 2 )% . (11.47) 
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Generators for SL(2, C) which correspond to K, J 

The transformations a € SL(2,C) as complex 2 x 2-matrices are denned through four complex 
or, correspondingly, eight real numbers. Because of the condition det(a) = 1 only six independent 
real numbers actually suffice for the definition of a. One expects then that six generators Gj and 
six real coordinates fj can be defined which allow one to represent a in the form 




exp > J f 3 G 3 • (11-48) 



We want to determine now the generators of the transformation a(z) as defined in (11.2) which 
correspond to the generators Ki, K 2 , K3, Ji, J 2 , J3 of the Lorentz transformations L^ v in the natural 
representations, i.e., correspond to the generators given by (10.47, 10.48). To this end we consider 
infinitesimal transformations and keep only terms of zero order and first order in the small variables. 
To obtain the generator of a(z) corresponding to the generator K±, denoted below as m, we write 
(11.36) 

M(L^ V A U ) = aM(A»)a) (11.49) 
assuming (note that g^ v is just the familiar Kronecker 5^ v ) 

L» v = g^u + e (K 1 )' i v (11.50) 
a = 1 + e«i . (11.51) 

Insertion of {K\)^ u as given in (10.48) yields for the l.h.s. of (11.49), noting the linearity of M(A M ), 

M(A» + e (Kif „ A") = M(A") + eM( (-A 1 , -A°,0,0) ) 

= M(A») - eaoA 1 - e^A (11.52) 

where we employed (11.25) in the last step. For the r.h.s. of (11.49) we obtain using (11.51) 

( 1 + e ki ) M(A^) ( 11 + e k{ ) 
= M{A») + e ( KiM(A^) + M(A")/e}) + 0(e 2 ) 

= M(A") + e(KiCT M + (7^1) A' 1 + 0(e 2 ) . (11.53) 
Equating (11.52) and (11.53) results in the condition 

aoA 1 - a x A° = ( Kl a„ + a„ k\ ) A" . (11.54) 
This reads for the four cases A^ = (1,0,0,0), A^ = (0, 1,0,0), A^ = (0,0,1,0), A^ = (0,0,0,1) 

— O"! = KidQ + 0"ok{ = Kl + k\ (11.55) 
(Jo = K\G\ + <7lfl{ (11.56) 

= ma 2 + <t 2 k\ (11.57) 
= K103 + ct 3 k{ . (11.58) 



One can verify readily that 



«i = --(Ti (11.59) 
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obeys these conditions. Similarly, one can show that the generators K2, K3 of a(z) corresponding to 
K2, K% and Ai, A2, A3 corresponding to J±, J2, J3 are given by 




(11.60) 



We can, hence, state that the following two transformations are equivalent 

L(w,$) = e^K + ^-J ) a(w-i#) = e"^'^ (11.61) 



G SO(3,l) G SL(2,C) 

acts on 4-vectors acts on spinors <p a G C 2 

(characterized below) 

This identifies the transformations a{z = w — I'd) as representations of Lorentz transformations, w 
describing boosts and $ describing rotations. 

Exercise 11.2.4: Show that the generators (11.60) of a G SL(2, C) correspond to the generators 
K and J of LV 



11.3 Spinors 

Definition of contravariant spinors 

We will now further characterize the states on which the transformation a(z) and its conjugate 
complex a*(z) act, the so-called contravariant spinors. We consider first the transformation a(z) 
which acts on a 2-dimensional space of states denoted by 

Q = ( ^2 ) e C 2 . (11.62) 

According to our earlier discussion holds 

(f) 1 transforms under rotations (z = ii)) like a spin-j state \ \, +\) 
4> 2 transforms under rotations (z = I'd) like a spin~| state \\, — \) . 

We denote the general (z = w + iti) transformation by 

4>' a = a a p(j)P d = a ^ 1 + a a 2 <p 2 , a = 1,2 (11.63) 
where we extended the summation convention of 4-vectors to spinors. Here 

•M - («» = ( % % ) (H-64) 

describes the matrix a(z). 
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Definition of a scalar product 

The question arises if for the states 4> a there exists a scalar product which is invariant under 
Lorentz transformations, i.e., invariant under transformations a(z). Such a scalar product does, 
indeed, exist and it plays a role for spinors which is as central as the role of the scalar product 
A^An is for 4-vectors. To arrive at a suitable scalar product we consider first only rotational 
transformations a(i-d). In this case spinors cf) a transform like spin-^ states and an invariant, which 
can be constructed from products 4> x x 2 •, etc., is the singlet state. In the notation developed in 
Chapter 5 holds for the singlet state 

|-,-;0,0) = V (0,0|-,m;-,-m)|-,m)i|-,-m) 2 (11.65) 

2 2 2 2 2 2 

m=±\ 

where | ■ ■ -)i describes the spin state of "particle 1" and | • • -) 2 describes the spin state of "particle 
2" and (0,0||,±|; |,=f|) stands for the Clebsch-Gordon coefficient. Using (0,0|^,±|; ^Tg) = 
±1/a/2 and equating the spin states of "particle 1" with the spinor (f) a , those of "particle 2" with 
the spinor x 13 one can state that the quantity 

S = ±= (^X 2 - </>V) (11.66) 

should constitute a singlet spin state, i.e., should remain invariant under transformations a(i-&). In 
fact, as we will demonstrate below such states are invariant under general Lorentz transformations 
a(z). 



Definition of covariant spinors 

Expression (11.66) is a bilinear form, invariant and as such has the necessary properties of a scalar 3 
product. However, this scalar product is anti-symmetric, i.e., exchange of <p a and x 13 alters the 
sign of the expression. The existence of a scalar product gives rise to the definition of a dual 
representation of the states 4> a denoted by <p a - The corresponding states are defined through 



It obviously holds 



4> x x 2 - 0V = 4> l xi + 4> 2 X2 (n.67) 



We will refer to 4> a ,x^, ... as contravariant spinors and to 4> a ,x/3, ... as covariant spinors. The 
relationship between the two can be expressed 



1 
-1 



(11.69) 
(11.70) 



3 'Scalar' implies invariance under rotations and is conventionally generalized to invariance under other symmetry 
trasnformations. 
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as can be verified from (11.68). The inverse of (11.69, 11.70) is 



^2 



- «- ( I I <»■«> 

/ -1 



c " - ( i o ' ■ (1L72) 



Exercise 11.3.1: Show that for any non-singular complex 2 x 2-matrix M holds 

eMe- 1 = det(M) (M _1 ) T 



The matrices e, e _1 connecting contravariant and covariant spinors play the role of the metric 
tensors gnu, 9^ of the Minkowski space [cf. (10.10, 10.74)]. Accordingly, we will express (11.69, 
11.70) and (11.71, 11.72) in a notation analogous to that chosen for contravariant and covariant 
4-vectors [cf. (10.72)] 

(11.73) 
(11.74) 



4>a = 




r = 






( en 


£af3 = 




V £21 






£ a/3 = 


( eU 




V e 21 



1 
-1 

e 12 \ / -1 
e 22 i " I 1 



(11.75) 
(11.76) 



The scalar product (11.67) will be expressed as 

= fxi + ^ 2 X2 = cp'x 2 ~ fx 1 • (H-77) 

For this scalar product holds 

rXa = -X a 4>« ■ (11-78) 
The transformation behaviour of (p a according to (11.63, 11.73, 11.75) is given by 

<p' a = e aP a^e^4>8 (11.79) 

as can be readily verified. 

Proof that 4> a Xa is Lorentz invariant 

We want to verify now that the scalar product (11.77) is Lorentz invariant. In the transformed 
frame holds 

<t>' a x' a = aVa 7 «V K /x K - (H-80) 



One can write in matrix notation 



(eae- 1 ) T a]^ . (11.81) 
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Using (11.2) and (11.14) one can write 

toe" 1 = ee^e" 1 = eh^^ 1 = e'^* (11.82) 

and with f(A) T = f(A T ) for polynomial f(A) 

(eae^f = e ~^*) T = e ~k g - 9 = a" 1 (11.83) 

Here we have employed the hermitian property of a, i.e., <j> = («t*) t = a. Insertion of (11.83) 
into (11.81) yields 



(eae- 1 ) T a]^ = [a^a]^ = b Kfi (11.84) 



and, hence, from (11.80) 

<P' a x' a = <Pxp • (H-85) 

The complex conjugate spinors 

We consider now the conjugate complex spinors 

m* = ( [fjy ) • (11-86) 
A concise notation of the conjugate complex spinors is provided by 

m* = r = ) (n.87) 

which we will employ from now on. Obviously, it holds (f> k = ((f) k )*, k = 1,2. The transformation 
behaviour of (ft a is 

<T = (a^)*/ (11.88) 

which one verifies taking the conjugate complex of (11.63). As discussed above, a*(z) provides 
a representation of the Lorentz group which is distinct from that provided by a(z). Hence, the 
conjugate complex spinors cf) a need to be considered separately from the spinors (f) a . We denote 

(off,)* = (11.89) 

such that (11.88) reads 

= a "p4>P (11.90) 

extending the summation convention to 'dotted' indices. 
We also define covariant versions of 4> a 

fa = (fa)* . (11.91) 
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The relationship between contravariant and covariant conjugate complex spinors can be expressed 
in analogy to (11.73, 11.76) 



' J o. 
id 



= e 



€ af3 



a/3 
c af5 



(11.92) 

(11.93) 
(11.94) 

(11.95) 



where e Q( g and e a/3 are the real matrices defined in (11.75, 11.75). For the spinors <p a and Xa thus 
defined holds that the scalar product 



0"xd = 4> l x\ + 4> 2 x 2 

is Lorentz invariant, a property which is rather evident. 
The transformation behaviour of <fia is 



(11.96) 



e d/3 a/3 7 e7 ^5 



(11.97) 



The transformation, in matrix notation, is governed by the operator ea*(z)e~ l which arises in 
the Lorentz transformation (11.18) of the bispinor wave function ^f, ea*(z) accounting for the 
transformation behaviour of the third and fourth spinor component of ^f. A comparision of (11.18) 
and (11.97) implies then that transforms like ^3, ^4, i.e., one can state 

4>i transforms under rotations (z = id) like a spin-^ state ||, +|) 
4>2 transforms under rotations (z = I'd) like a spin-^ state \ \,—\) ■ 

The transformation behaviour of ^ (note that we do not include presently the space-time depen- 
dence of the wave function) 



(11.98) 



obviously implies that the solution of the Dirac equation in the chiral representation can be written 
in spinor form 









^ 2 (x^) 




2 (x^) 


^ 3 (* M ) 






V *4^) / 







(11.99) 



11.4 Spinor Tensors 

We generalize now our definition of spinors 4> a to tensors. A tensor 



(11.100) 
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is a quantity which under Lorentz transformations behaves as 

k t 

i /aia 2 -a fc /3 1 /j 2 -ft = "Q Q a m ^ "Q a /3„.^ pvTkSv-St _ (11.101) 



m=l n=l 



An example is the tensor i"' 3 which will play an important role in the spinor presentation of the 
Dirac equation. This tensor transforms according to 

t' a P = aVV 7 * (11.102) 
This reads in matrix notation, using conventional matrix indices j, k, £, m, 

t'jk = ( atat ) jfc = a ^4m^m ■ (11.103) 

Similarly, the transformation bevaviour of a tensor reads in spinor and matrix notation 

t'"P = a^a^gf 6 , t' jk = {ata T ) jk = £ a je a km t £m (11.104) 

£,m 

Indices on tensors can also be lowered employing the formula 

tj = e ai t^ (11.105) 

and generalizations thereof. 

An example of a tensor is e Q/3 and e a p. This tensor is actually invariant under Lorentz transforma- 
tions, i.e., it holds 

e' a( * = e al3 , e' a/3 = e a p (11.106) 



Exercise 11.4.1: Prove equation (11.106). 



The 4 vector M 1 in spinor form 

We want to provide now the spinor form of the 4- vector A 1 *, i.e., we want to express through a 
spinor tensor. This task implies that we seek a tensor, the elements of which are linear functions 
of A^. An obvious candidate is [cf. (11.25] M(A^) = o^A^. We had demonstrated that M(A^) 
transforms according to 

M' = M(L fl u A u ) = aM(A»)a< (11.107) 
which reads in spinor notation [cf. (11.102, 11.103) 

A' a P = a a y^ h . (11.108) 

Obviously, this transformation behaviour is in harmony with the tensor notation adopted, i.e., with 
contravariant indices a$. According to (11.25) the tensor is explicitly 

p ( A 11 A 1 * \ ( A + A 3 A 1 - iA 2 
A ~ ^A 2i A 22 J ~ {A 1 + iA 2 A - A 3 



(11.109) 
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One can express A a/3 also through 

A<* = ( A \~ A \ ~ Al V A2 ) • (11-110) 
V -A x - iA 2 A + A 3 J v ' 

The 4- vectors A^ 1 , A^ can also be associated with tensors 

A a$ = e^A* . (11.111) 

This tensor reads in matrix notation 

A ±i A 12 \ ( 1 \ ( A 11 A 1 * \ ( -1 



(11.112) 



A 2i A 22 J V -1 J \ A 21 A 22 I \ 1 



A 22 -A 2i 
-A 12 A 11 



Hence, employing (11.109, 11.110) one obtains 



A - A 3 -A 1 - iA 2 
A a$ = [ _ A i + iA 2 A o + A 3 ) (H-113) 



Ks = ( f° + ± A l + lA l ) • (n-114) 



af3 \ Ai - iA 2 A - A 3 
We finally note that the 4-vector scalar product A^B^ reads in spinor notation 

A»B, = \A^B a$ . (11.115) 



Exercise 11.4.2: Prove that (11.115) is correct. 



<9 M in spinor notation 

The relationship between 4-vectors A^ 1 , A^ and tensors t a/3 can be applied to the partial differential 
operator d^. Using (11.110) one can state 

ff* = ( "V/?) • (11-116) 

\ -di - id 2 d + d 3 J 



Similarly, (11.114) yields 
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in Tensor Notation 

We want to develop now the tensor notation for (11.26) and its contravariant analogue a 11 

1 W 1 W -i W 1 

oiy'Vioy'Vi oy'Vo-i 

For this purpose we consider first the transformation behaviour of cr M and o^. We will obtain the 
transformation behaviour of a^, a a building on the known transformation behaviour of 7^. This is 
possible since 7^ can be expressed through a^^a^. Comparision of (10.229) and (11.118) yields 

r = C ° ) . (11.119) 



Using (11.15) one can write 
and, hence, 



afl = e K)*e- 1 (11.120) 

One expects then that the transformation properties of should follow from the transformation 
properties established already for 7^ [c.f. (10.243)]. Note that (10.243) holds independently of the 
representation chosen, i.e., holds also in the chiral representation. 

To obtain the transformation properties of we employ then (10.243) in the chiral representation 
expressing S(L\) by (11.18) and 7^ by (11.121). Equation (10.243) reads then 

a \ / 0* \ ( a' 1 



(11.122) 



e^e- 1 J \ e(^)*e- 1 J \ e^e" 1 ) L * 



a v 
e(a u )*e~ l 



The l.h.s. of this equation is 



ea*(^)*e- 1 a- 1 ' L " IU - 



and, hence, one can conclude 



a^e-Vr 1 ^^ = ^ (11.124) 

ea*K)*e- 1 a- 1 L% = ef/j'r 1 . (11.125) 

Equation (11.125) is equivalent to 

a *{a tx ye- 1 a- 1 eV / lx = (a u )* (11.126) 



which is the complex conjugate of (11.124), i.e., (11.125) is equivalent to (11.124). Hence, (11.124) 
constitutes the essential transformation property of <7 M and will be considered further. 
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One can rewrite (11.124) using (11.16, 11.2) 



e2' 



Exploiting the hermitian property of a, e.g., (<?*) = a yields using (11.2) 



One can express, therefore, equation (11.124) 

aff" [a*] T L\ = a v . 



(11.127) 



(11.128) 



(11.129) 



We want to demonstrate now that the expression aa fl [a*] T is to be interpreted as the transform 
of (J^ under Lorentz transformations. In fact, under rotations the Pauli matrices transform like 

a = i,2,3) 



a{i"&) <jj ( a(ii?) 



a(i$)aj a*(i#) 



(11.130) 



We argue in analogy to the logic applied in going from (11.107) to (11.108) that the same trans- 
formation behaviour applies then for general Lorentz transformations, i.e., transformations (11.2, 
11.4) with w / 0. One can, hence, state that <t m in a new reference frame is 



(11.131) 



where a is given by (11.2, 11.4). This transformation behaviour, according to (11.102, 11.103) 
identifies <t m as a tensor of type t a/3 . It holds according to (11.118) 

I ((T/1) 2i ((jM) 22 ] 



1 
1 



-1 
-1 

"V 



i 
-i 



-1 
1 



(11.132) 



fj, = 2 



/* = 3 



and 



M 11 M 1 * 
K) 2 i K) 22 



1 o 
o 1 



1 \ / -i 

1 J ' V i 
y v ' v ' ^ 



1 
-1 



(11.133) 



n = i 



li = 2 



H = 3 



) 



Combining (11.129, 11.131) one can express the transformation behaviour of in the succinct 
form 

L%a'» = a v . (11.134) 
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Inverting contravariant and covariant indices one can also state 

L u ^ a' = a v . 



(11.135) 



This is the property surmised already above [cf. (11.29)]. We can summarize that and 
transform like a 4-vector, however, the transformation is inverse to that of ordinary 4-vectors. 
Each of the 4x4 =16 matrix elements in (11.132) and (11.133) is characterized through a 4-vector 
index fi, /j, = 0, 1,2,3 as well as through two spinor indices a(3. We want to express now and 
o" M also with respect to the 4-vector index /i in spinor form employing (11.114). This yields 



'a/9 



CO + C3 
<7l — i(J2 



<T\ + 102 



= 2 



V 



\ 



(11.136) 



where on the rhs. the submatrices correspond to t a/3 spinors. We can, in fact, state 



/ 



V 



( CT ii 




(^li 




' ( CT 12 


11 


( ff 12 


12 


(^li 


,2i 


(^li 


22 J 


V ( a 12 


2i 


(^12 


22 


( a 2i 


)" 






' (°22 


11 


(°22 


12 


( CT 2i 


,2i 


( a 2i 


22 J 


V ( a 22 


2i 


(°22 


22 



(11.137) 



Equating this with the r.h.s. of (11.136) results in the succinct expression 

7<5 



a/3 



(11.138) 



Note that all elements of a a/ j are real and that there are only four non- vanishing elements. 

In (11.138) the 'inner' covariant spinor indices, i.e., a, (3, account for the 4-vector index /i, whereas 
the 'outer' contravariant spinor indices, i.e., 7,5, account for the elements of the individual Pauli 
matrices. We will now consider the representation of a^, in which the contravariant indices are 
moved 'inside', i.e., account for the 4-vector /U, and the covariant indices are moved outside. The 
desired change of representation^^)"' 3 — ► ( a ^) a $ corresponds to a transformation of the basis of 
spin states 

and, hence, corresponds to the transformation 



(11.139) 
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(11.140) 



where we employed the expressions (11.12) for e and e 1 . Using (11.110) to express a al3 in terms 
of a M yields together with (5.224) 
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and, employing then transformation (11.140) to transform each of the four submatrices, which in 
(11.137) are in a basis (• • -) a/3 to a basis (• • •) a results in 



7<5 



This can be expressed 
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Combined with (11.138) one can conclude that the following property holds 



(0 2 2 J J 



(11.142) 



(11.143) 



(11.144) 



The Dirac Matrices 7 M in spinor notation 

We want to express now the Dirac matrices 7^ in spinor form. For this purpose we start from the 
expression (11.121) of 7A This expression implies that the element of 7^ given by ecr^e" 1 is in the 
basis \ a p whereas the element of 7^ given by cr M is in the basis | Q/3 . Accordingly, we write 
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(« 



(a» 

at? c 



Let be a covariant 4-vector. One can write then the scalar product using (11.115) 
Exploiting the property (11.144) results in the simple relationship 



A a @ 



(11.145) 



(11.146) 



(11.147) 
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11.5 Lorentz Invariant Field Equations in Spinor Form 

Dirac Equation 

(11.147) allows us to rewrite the Dirac equation in the chiral representation (10.226) 

i-fd^ix*) = ( .!? J = ra$(/). (11.148) 

Employing ^(x^) in the form (11.99) yields the Dirac equation in spinor form 

iff^Xp = m <P a (11.149) 

id^r = m X$ . (11.150) 

The simplicity of this equation is striking. 



Chapter 12 



Symmetries in Physics: Isospin and 
the Eightfold Way 

by Melih Sener and Klaus Schulten 

Symmetries and their consequences are central to physics. In this chapter we will discuss a particular 
set of symmetries that have played a seminal role in the development of elementary particle and 
nuclear physics. These are the isospin symmetry of nuclear interactions and its natural extension, 
the so-called Eightfold Way. 

The organization of this chapter is as follows: In the next section we will discuss the relation 
between symmetries of a quantum mechanical system and the degeneracies between its energy 
levels. We will particularly use the example of spherically symmetric potentials. In the following 
section we will introduce the concept of isospin as an approximate SU (2) symmetry, which identifies 
the proton and the neutron as different states of the same particle. We will also introduce the quark 
model as a natural framework to represent the observed symmetries. We will apply these concepts 
to an analysis of nucleon-nucleon and nucleon-meson scattering. In the final section, we will discuss 
the SU{3) symmetry of three quark flavors. The algebraic structure and the representations of 
SU(3) will be discussed in parallel to SU(2) and particle families will be identified in terms of 
representations of the underlying symmetry group. 



12.1 Symmetry and Degeneracies 



The degeneracies of energy levels of a quantum mechanical system are related to its symmetries. 
Let us assume a continuous symmetry obeyed by a quantum mechanical system. The action of the 
symmetry operations on quantum mechanical states are given by elements of a corresponding Lie 
group, i.e., 

= exp^> fel Sfcj • (12.1) 

Then the generators, S k , will commute with the Hamiltonian of the system, 

[H,S k } = 0. (12.2) 
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The action of any symmetry generator, S k , on an energy eigenstate, ?pE,\i,...,\ n , leaves the energy 
of the state invariant 



Hex.p(ia k S k ) ipEM,...,x n = Eexp(ia k S k ) ip E ,\ 1: ...,\ n - (12.3) 

If the newly obtained state is linearly independent of the original one, this implies a degeneracy in 
the spectrum. We will investigate this shortly in more detail in the case of systems with spherical 
symmetry, where the symmetry generators, S k , can be identified with the angular momentum 
operators, J k , as studied in chapter 7. 

Lie groups play an essential role in the discussion of mass degeneracies in particle physics. In order 
to illustrate this, we first consider a particular example of the implications of symmetry, namely 
motion in a spherically symmetric potential described by the group 50(3) (or its double covering 
SU{2) as discussed in section 5.12). 

In chapter 7 the dynamics of a particle moving in three dimensions under the influence of a spher- 
ically symmetric potential, V(r), has been discussed. The spherical symmetry implies the commu- 
tation of the Hamiltonian with angular momentum operators (7.8) 

[H, J k ] = 0, k = 1,2,3. (12.4) 

The stationary Schrodinger equation, (7.18), can then be reduced to a one-dimensional radial 
equation, (7.24), which yields a set of eigenstates of the form 

— VE,£,m 

(r)Y tm (0,<i>), (12.5) 

with m = —I, and the corresponding energy levels are independent of m. Therefore, each 

energy level is (21 + l)-fold degenerate. This degeneracy follows from the fact that any rotation, 
as represented by an element of 50(3), (7.39), generates a state which has the same energy as the 
original one. 



Presence of additional symmetries may further increase the degeneracy of the system. As an 
example of this we will consider the Coulomb problem with the Hamiltonian 

H = f- - -. (12.6) 

From elementary quantum mechanics we know the spectrum of the hydrogen atom. The energy 
levels are 

„ fnk 2 

where the orbital angular momentum, I, is allowed to take values in 0, . . . , n — 1. The energy levels 
are totally independent of I. For example, the states 3s, 3p and 3d all have the same energy. We 
want to understand this extra degeneracy in terms of the extra symmetry of the hydrogen atom 
given by an additional set of symmetry generators introduced below. 

Classically, the additional symmetry generators of the Coulomb problem are the three components 
of the so-called eccentricity vector discovered by Hamilton 
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The vector e points along the symmetry axis of the elliptical orbit and its length equals the eccen- 
tricity of the orbit. The vector in (12.8) is not a hermitian operator. The corresponding quantum 
mechanical hermitian operator can be defined by 



ei = 


1 

2m 


{P2J3 


-P3J2 + hP2 - 


- J2P3) 


ei = 


1 

2m 


(J3P2 


- -hpz + ihpi) 


- k—, 
r 


£2 = 


1 

2m 


(JlP3 


- J3P1 + itm) 


r 


£3 = 


1 

2m 


(J2PI 


- J\P2 + ihpz) 


u X3 
- k—, 
r 



Xl 

J 

r 



(12.9) 

(12.10) 

(12.11) 

(12.12) 
(12.13) 

It can be verified explicitly that its components commute with the Hamiltonian. 
In order to understand the aforementioned extra degeneracy, we will compute the hydrogen spec- 
trum using the additional symmetry. For this purpose we first note that 

J-e = 0. (12.14) 

This follows from a • (a x b) = (a x b) ■ a = 0, which is valid even when a and b do not commute. 
We will also need the following identity [4] 



r 2 = — 

m 



(F + h^+k 2 , (12.15) 



which can be proved after very considerable algebra. 

In the following we consider the bound states, which have a negative energy E. Therefore, in the 
subspace of the Hilbert space corresponding to a certain energy we can replace H by E. Now we 
scale the eccentricity vector as follows 



tf=VS!?. (12.16) 
Through some algebra [4] the following commutation relations can be verified 

[Ki,Jj] = ihe i]k K k , (12.17) 
[Ki,Kj] = ihe ijk J k , (12.18) 

which complement the familiar angular momentum algebra of section 5.3. 
We introduce the following new operators 

A=^(J + K), (12.19) 
B=\{J-K), (12.20) 



which can be shown to satisfy 



[Ai,Aj] = ihe ijk A k , (12.21) 
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h, (12.22) 

(12.23) 

(12.24) 
(12.25) 

So far we have shown that the symmetry generators form an algebra, which is identical to the the 
direct sum of the Lie algebra of of two SO (3) (or SU{2)) algebras. By comparing to the rotation 
algebra introduced in chapter 5, we can read off the eigenvalues of A 2 and B 2 from (12.21) and 
(12.22): 



Bi,Bj 


= ihe 




= o, 


A, H 


= o, 


B,H 


= 0. 



A 2 



B 



1 



a(a + l)h , a = 0, -, 1, 
b(b + l)h 2 ,6 = 0,^,1,. 



Following (12.14) we note that 



A 2 - B 2 = J - e = 0. 

This implies that a = b. In order to arrive at the spectrum a final bit algebra is needed 

A 2 + B 2 = J 2 + K 2 



2E 



mk 
~\E 



(12.26) 
(12.27) 

(12.28) 

(12.29) 
(12.30) 

(12.31) 



where we have used (12.15). Using this equation the energy eigenvalues can be written in terms 
of the eigenvalues of A 2 and B 2 operators. Noticing that A 2 and B 2 have the same eigenvalues 
because of (12.28), the energy eigenvalues are found to be 



E = — 



mk 



2h 2 {2a + l) 2 ' 



o,^,i,... 



(12.32) 



A comparison with (12.7) tells us that (2a + 1) = n. Furthermore the bound on the orbital angular 
momentum, I, can be seen to follow from the triangle inequality as applied to J = A + B, namely 
that 

(12.33) 
(12.34) 

It follows that I has to have values in {0 = \a — b\, 1, . . . , a + b = n — 1}. This illustrates the effect 
of additional symmetries to the degeneracy structure of a quantum mechanical system. 
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> 


A 




B 


= 




J 


< 


A 


+ 


B 


= 2 


A 



In contrast to the discussion above about extra symmetries, a lack of symmetry implies a lack of 
degeneracy in the energy levels of a quantum mechanical system. The most extreme case of this is 
the quantum analogue of a classically chaotic system. Chaos is described classically as exponential 
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sensitivity to initial conditions, in the sense that nearby trajectories in the phase space diverge from 
each other over time. However, another manifestation of chaos is the lack of independent operators 
commuting with the Hamiltonian. 

A typical example of this so called quantum chaos is the quantum billiard problem, which is a 
particle in box problem in two dimensions with a boundary which can be chosen arbitrarily. If the 
chosen boundary is 'irregular' in a suitably defined sense, the classical trajectories will diverge from 
each other after successive bounces from the boundary. For a more detailed discussion of quantum 
chaos in billiard systems we refer the reader to [7] and the references therein. 
In the case of billiards and other examples of quantum chaos one common observation is the almost 
nonexistence of degeneracies and the fact that the energy levels are more evenly spaced. This is 
known as level repulsion. 

In the next section we will proceed with the discussion of a symmetry, which was discovered by 
observing degeneracies in the particle spectrum. 

12.2 Isospin and the 577(2) flavor symmetry 

The concept of isospin goes back to Heisenberg, who, after the discovery of the neutron in 1932, 
suggested that the proton and the neutron can be regarded as two states of a single particle. This 
was motivated by the observation that their masses are approximately equal: m p = 938.28MeV/c 2 , 
m p = 939.57Mel//c 2 . Following the mass-energy equivalence of special relativity 



this mass equivalence can be viewed as an energy degeneracy of the underlying interactions. 
This (approximate) degeneracy led into the idea of the existence of an (approximate) symmetry 
obeyed by the underlying nuclear interactions, namely, that the proton and the neutron behave 
identically under the so-called strong interactions and that their difference is solely in their charge 
content. (Strong interactions bind the atomic nucleus together.) 

If the proton and the neutron are to be viewed as two linearly independent states of the same 
particle, it is natural to represent them in terms of a two component vector, analogous to the 
spin-up and spin-down states of a spin- \ system 



In analogy to the concept of spin regarding the rotations in 3-space as discussed in chapter 5, the 
isospin symmetry is also governed by an SU{2) group 'rotating' components in (12.36) into each 
other in abstract isospin space. This enables us to utilize what we already know about the SU{2) 
symmetry group from the study of angular momentum. For example, we will be able to use the 
familiar Clebsch-Gordan coefficients to combine the isospin of two particles the same way we added 
spin in chapter 6. 

It is important to place the isospin concept in its proper historical context. Originally it was 
believed that isospin was an exact symmetry of strong interactions and that it was violated by 
electromagnetic and weak interactions. (Weak interactions are responsible, for example, for the 



E = mc 2 , 



(12.35) 




(12.36) 
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beta decay). The mass difference between the neutron and the proton could then be attributed to 
the charge content of the latter. If the mass difference (or the energy difference) were to be to be 
purely electrostatic in nature, the proton had to be heavier. However, the proton is the lighter of 
the two. If it were otherwise the proton would be unstable by decaying into the neutron, spelling 
disaster for the stability of matter. 

Isospin symmetry is not an exact symmetry of strong interactions, albeit it is a good approximate 
one. Therefore it remains a useful concept. Further than that, as we shall see below, it can be seen 
as part of a larger (and more approximate) symmetry which is of great utility to classify observed 
particle families. 



We can describe isospin multiplets the same way we have described the angular momentum and spin 
multiplets. Denoting the total isospin, /, and its third component, ^3, as good quantum numbers, 
we can re- write (12.36) as a multiplet with I = \ 



P 



n 



(12.37) 



As an example of a multiplet with / = 1 we have the three pions or 7r-mesons 

7T+ = |1,1) , 7T° = 1 1, 0> , TT~ = 1 1, — 1> , (12.38) 

which have all nearly identical masses. (m^± = 139.6MeV/c 2 , m^u = 135.0MeV/c 2 ). Shortly we 
will see how to describe both the pion and nucleon states as composites of more fundamental I = \ 
states. 

In the framework of the quark of model, the fundamental representation of the isospin symmetry 
corresponds to the doublet that contains the so-called up and down quarks 



u = 



1 1 

2' 2 



, d 



1 1 

2'~2 



(12.39) 



All other isomultiplets, including the proton and the neutron, are made up of these two quarks. 
They can be constructed with the same rules that have been used for angular momentum addition 
in chapter 6. For example, the three pions in (12.38) are ud, uu and du states, respectively. They 
form an isotriplet: 
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(12.40) 

(12.41) 

(12.42) 
(12.43) 



Similarly, the proton and the neutron can be written as totally symmetric uud and udd states. For 
a precise description of the two nucleons as composite states, including the spin and color quantum 
numbers of their constituent quarks, we refer the reader to [1], sec. 2.11. 
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The mass of the up and down quarks are not identical but they are both of the order of a few 
MeV/c 2, s which is minuscule compared to the typical energy scale of hadrons (i.e. strongly inter- 
acting particles) which is about a GeV/c 2 . This is why isospin is such a good symmetry and why 
isomultiplets have nearly identical masses. 

As it later turned out, the up and down quarks are not the only quark 'species' - or flavors as 
they are commonly called. In the late 1940's and early 1950's, a number strange particles have 
been found which presumably contained a third quark species: the strange quark. It shall be noted 
here that the quark model was not invented until 1960's, but at the time the empirical concepts 
like isospin and strangeness quantum numbers were in use. The value of the strangeness quantum 
number is taken, by accidental convention, to be —1 for the strange quark. The up and down 
quarks have strangeness zero. All other composite states have their strangeness given by the sum 
of the strangeness content of their constituents. 

Before proceeding further, we shall setup some terminology: baryons are qqq states, such as proton 
and the neutron, whereas mesons are qq states, the pions being examples thereof. By convention 
baryons have baryon number 1, and quarks have baryon number ^. All antiparticles have their 
quantum numbers reversed. Naturally, mesons have baryon number 0. The names, baryon and 
meson, originally refer to the relative weight of particles, baryons generally are heavy, mesons have 
intermediate mass ranges, where leptons (electron, muon, the neutrinos etc.) are light. If taken 
literally, this remains only an inaccurate naming convention today, as some mesons discovered later 
are heavier than some baryons and so on. 

The relation between electric charge and isospin are given by the Gell-Mann-Nishijima relation 
which was first derived empirically 

Q = h+ l -{B + S), (12.44) 

where B is the baryon number and S is the strangeness. In the next section we will be able to 
view the Gell-Mann-Nishijima relation in the light of the representation theory for the flavor SU(3) 
symmetry. 

Now let us consider another example of combining the isospins of two particles. The reader may 
know that the deuteron, a hydrogen isotope, consists of a proton and a neutron. Therefore it has 
to have isospin, ^3 = 0. We will now try to describe its wave function in terms of its constituent 
nucleons. Following (12.37) and in analogy to (12.43), this will be mathematically identical to 
adding two spins. The possibilities are that of an isosinglet 

|0,0) = ^(|p> \n> -\n> \p>) (12.45) 
v2 

and that of an isotriplet 

|1,1) = \p>\p>, (12.46) 

1 1, 0> = -)={\p > \ n > +\ n >\P>)> (12.47) 
v2 

1 1, — 1> = \n>\n>. (12.48) 

Is the deuteron an isosinglet state or an isotriplet? If it were an isotriplet (|1,0)) we should have 
seen nn and pp bound states of comparable energy in nature (because of isospin symmetry), but 
such states do not exist. Therefore the deuteron has to be an isosinglet state (|0,0)). 
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As an exercise on the implications of isospin symmetry we will consider nucleon-nucleon scattering. 
We will eventually be able to compute ratios of scattering cross-sections between different processes. 
For example, consider 

® P + P ^ d j~ 1T+ o (12.49) 
(II) p + n -> d + vr° y ' 

The only assumption that we put in will be that the interaction is of the form V = a I^ 1 -* • I^ 2 ). The 
dot product here refers to the abstract isospin space. The cross-section, a, is proportional to |.M| 2 , 
where M is the scattering amplitude given by 

M = ( final | al (1) • I (2) | initial ) . (12.50) 

The initial and final states can be denoted in more detail as 

| initial) = I 7 W ), (12.51) 

(final) = I , , ) , (12.52) 

where 7W and 7^ denote degrees of freedom other than isospin, such as the spatial dependence 
of the wave function and spin. 

Exercise. Consider the generalization of tensor operators discussed in section 6.7 to the case of 
isospin. Show that l( 1 )-l( 2 ) is an 'isotensor' of rank zero. Refer to exercise 6.7.5 for the spin-analogue 
of the same problem. 

Exercise. Show that the expectation of 1^ • I^ 2 ) is \ in an isotriplet state and — | in an isosinglet 
state. 

Using (12.38) and the fact that the deuteron is an isosinglet we know that the isospins of the final 
states in (I) and (II) are |1, 1) and |1,0), respectively. According to (12.45) and (12.48) the initial 
states in (I) and (II) have isospin values |1,1) and (l/>/2)(|l,0) + |0,0)). We will now employ the 
(isospin analogue) of the Wigner-Eckart theorem (6.259) discussed in detail in sections 6.7 and 6.8 
to compute the ratio of the scattering amplitudes Mi and M\\. For completeness let us start by 
restating the Wigner-Eckart theorem (6.259) in the present context: 

</(/)/P,7^|T 00 |/»/f , 7 »> = (12.53) 
(/(/)/P|00/«/«) (-1)^-^ 7 =L= (/(/),7(/)||r ||/« 7 «>. (12.54) 

Here T 00 = V = al (1) • I (2) , which is an isoscalar, as discussed in the exercise above. 
(/(/)4 /) |00/«4* ) ) is a Clebsch- Gordon coefficient and </^,7^||To||/W,7W) is a reduced ma- 
trix element defined in the same sense as in section 6.8. 

Now let us re-write more carefully the scattering amplitudes for the two processes in the light of 
what we have just learned 

-Mi = (/^ = l,4 /) = l,7 {/) Too /« = 1,/« = 1,7 W ) (12.55) 
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(11|0011) 4= ( /(/) = !»7 (/) ll T oo|| / W = 1,7 W ; 
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-Mi, - —(/(/) = 1,7^=0, 7 (/) 
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(12.56) 
(12.57) 
(12.58) 

(12.59) 

(12.60) 
(12.61) 



Note that the second term in A^n vanishes due to the isospin conservation, which is also manifested 
by a vanishing Clebsch-Gordon prefactor. The relevant Clebsch-Gordon coefficients are easily- 
evaluated : 

(11|0011) = (10|0010) = 1. (12.62) 
We can now write the ratio of the scattering amplitudes: 

Mi 1 



Mn (1/V2) ' 



(12.63) 



where common dynamical factors (which would not be as easy to compute) have dropped out 
thanks to the Wigner-Eckart theorem. It follows 



^=2, 
on 

which is in approximate agreement with the observed ratio. [2] 



(12.64) 



As a further example, we will consider pion-nucleon scattering. We want to compute the ratio of 
total cross-sections assuming a similar interaction as in the previous example 



The possibilities are 



a(iT + +p — ► anything 
a(ir~ + p — > anything 



(a) 7T + + p — > 7T + + p , 

(b) 7T~ +p — ► 7T~ +p, 

(c) 7T~ + p — > 7T° + n . 



(12.65) 



There are more exotic possibilities, involving, for example, particles with strangeness, but these are 
not dominant at relatively low energies. 

Once again we need the isospins for the initial and final states, which are obtained by a standard 
Clebsch-Gordan expansion 
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(12.66) 
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As in the example of nucleon-meson scattering we define the relevant matrix elements 



Ms = ( §,m I V \ Lm 



2 ' I 12'' 



Ml = < i,m| V | §,m), (12 - 67) 



which are independent of m. A computation similar to the previous example of nucleon-nucleon 
scattering yields (apart from common prefactors) the following amplitudes for the reactions in 
(12.65) 

M a = Ms 

2 

M h = lMs + lMi (12.68) 



M c = ^Ms - #M 



Guided by empirical data we will further assume that Ms » Mi, which leads to the following 

2 2 

ratios for the cross-sections 

u a : a b : a c = 9 : 1 : 2. (12.69) 
As the total cross-section is the sum of individual processes we obtain 

= = 3 (12.70) 

again in approximate agreement with the observed value. [2] 

12.3 The Eightfold Way and the flavor SU(3) symmetry 

The discovery of the concept of strangeness, mentioned in the previous section, was motivated 
by the existence of particles that are produced strongly but decay only weakly. For instance, 
K + , which can be produced by 7r~ + p — > K + + XT, has a lifetime which is comparable to that 
of 7r + albeit being more than three times heavier. Hence Gell-Mann and independently Nishijima 
postulated the existence of a separate quantum number, S, called strangeness, such that S(K + ) = 1, 
S(Y<~) = — 1 and S'(7r) = S(N) = 0, etc. It was assumed that strong interactions conserved S (at 
least approximately), while weak interactions did not. Hence the strangeness changing strong 
decays of K + (or £~) were forbidden, giving it a higher than usual lifetime. 

The classification of the newly found particles as members of some higher multiplet structure 
was less obvious then the case of isospin, however. Strange partners of the familiar nucleons, for 
example, are up to 40% heavier, making an identification of the underlying symmetry and the 
multiplet structure less straightforward. 

In the light of the quark model, it appears an obvious generalization to add another component for 
an extra quark to the isospin vector space 



u= \ ,d= \ 1 ,s= . (12.71) 
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In this case the transformations that 'rotate' the components of (12.71) into each other, while 
preserving the norm, have to be elements of the group SU(3) (which we will investigate closely 
very soon). However, history followed the reverse of this path. First particle multiplets were 
identified as representations of the SU (3) group, the same way nucleons and pions were identified 
as representations of the isospin SU (2) symmetry. Then came the question of what the fundamental 
representation, as in (12.71), should correspond to, giving rise to the quark model. As quarks were 
never directly observed, for a period they were considered as useful bookkeeping devices without 
physical content. 

In this perspective the flavor SU(3) symmetry may appear to be mainly of historical interest. 
However SU(3) symmetry appears in another and much more fundamental context in strong in- 
teraction physics. The quarks posses another quantum number, called color, which again form 
representations of an SU(3) group. This is believed to be an exact symmetry of strong interac- 
tions, in fact modern theory of strong interactions is a 'gauge theory' of this color group, called 
quantum chromodynamics. (The reader is referred to section 8.3 for a brief discussion of gauge 
transformations). Flavor SU(Nf) symmetry on the other hand, where Nf is the number quark 
flavors, becomes increasingly inaccurate for Nf > 3. The reason is that the other known quarks, 
namely charm, bottom (beauty) and top (truth) are significantly heavier than the hadronic energy 
scale. (The 'bare' mass of the charm quark is already heavier than the two nucleons, which set the 
hadronic energy scale. The bottom and top are even heavier. [2] ) 

Before discussing the significance and the physical implications of the quark model, we will establish 
some mathematical preliminaries about the group SU(3). In many respects it will resemble the 
more familiar group SU{2) discussed in some detail in chapter 5, but there are a number of subtle 
differences. The reader shall note that most of what is being said trivially generalizes to other 
unitary groups, SU(N), but we will stick to N = 3 in the following. The reader is also invited to 
revisit section 5.1 whenever necessary, in reference to Lie groups, Lie algebras and related concepts. 

Given a complex vector, a^, of three dimensions, we want to find those transformations 

a k -» U ki ai (12.72) 

that preserve the norm, J2k a *k a ki of a. It is seen that such a matrix U has to satisfy the following 
unitary relation 

U ] = tr 1 . (12.73) 

To verify that all such matrices form a group, we observe that 

(L/V)t = ytf/t = = (UV)- 1 , (12.74) 

for any two unitary matrices U and V. This group of 3 x 3 unitary matrices is denoted by U(3). 
The unitarity relation imposes 9 constraints on the total of 18 real degrees of freedom of a 3 x 3 
complex matrix. Hence the group Z7 (3) has 9 dimensions. Multiplying U by a phase, e 1 ^, still leaves 
the norm invariant. Therefore U(3) can be decomposed into a direct product U(l) x SU(3) where 
SU(3) consists of 3 x 3 unitary matrices of unit determinant. Because of this additional constraint 
SU(3) has 8 dimensions. Since arbitrary phase factors are of no physical interest, it is the group 
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SU(3) and not f7(3) that is of main interest. The reader is invited to compare the structure of 
SU(3) to that of SU{2) discussed in section 5.7. 

As discussed in section 5.1, any unitary matrix, U, can be written in the form 

U = e iH (12.75) 

where if is a hermitian matrix. Therefore we will express elements of SU(3) as 

U = e iY -" akXk (12.76) 

where X k are 8 linearly independent matrices forming the basis of the Lie algebra of SU{3). (We 
shall at times refer to the Lie algebra with the name of the group, the meaning being apparent from 
the context.) The unit determinant condition requires that all are traceless, since det(e A ) = e trA . 
An explicit basis is constructed in analogy to the Pauli algebra of spin operators 



> ^3 



A 4 





(12.77) 



A 8 = 

The generators, obey the following relation 

tr(Aj-Afc) = 2S jk , (12.78) 

which can be verified explicitly as matrix identities. 

The Lie algebra structure is given by the commutators of X k 

[Xj,X k ] = 2if jkl X u (12.79) 

where fj k i are the antisymmetric structure constants similar to the Cj k i of SU{2) given in (5.32). 
We can also introduce the constants, Sjki, via the anticommutator relation 

[A„A fe ]+ = p jk + 26 jM X l , (12.80) 

This is the fundamental or defining representation of SU(3). As in the case of SU{2) higher 
dimensional representations obeying the same structure can be found. The fundamental relation 
to be preserved is (12.79), regardless of the dimension of the representation. 
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As it turns out the set of generators in (12.77) is not the most useful basis in the study of SU(3). 
In the case of SU(2) the identification of 'ladder' operators { J + , J_} proved useful, which satisfied 
an 'eigenvalue equation' 

[Jo, J±] = ±thJ±. (12.81) 

In chapter 5, these relations have been used to construct the angular momentum spectrum as well 
as the function space representation of the rotation algebra, namely spherical harmonics. The 
generators of S77(3) can be arranged into a very similar form to that of SU(2). We first introduce 
the F-spin operators 

Fi = \K (12.82) 

With another change of basis we arrive at the 'standard' form of the generators of the Lie algebra 
of SU(3) 

T± = F 1 ±iF 2 , (12.83) 
T = F 3 , (12.84) 
V± = T 4 ±iF 5 , (12.85) 
U± = F 6 ±iF 7 , (12.86) 

Y = ^=F 8 . (12.87) 



Exercise. Using the convention in (12.71) show that To is the isospin operator, I3. 

Exercise. Derive the Gell-Mann-Nishijima relation (12.44), starting with the observation that 
Y = B + S. (Recall that the strange quark has S = — 1 by convention). Y is called the hypercharge. 

In the basis (12.87) the commutation relations between the generators can be expressed in a succinct 
manner. First, we have 

[Y,T } = 0, (12.88) 
which defines (not uniquely) a maximal set of mutually commuting operators {Y, To} and 

[Y,T±] = 0, (12.89) 
[Y,U±] = ±U±, (12.90) 



W,V±] = ±V±, (12.91) 



[T ,T±] = ±T±, (12.92) 
[T ,U±] = t\u±, (12.93) 

[To,V ± ] = ±±V ± , (12.94) 
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which relate the remaining generators {T±, V±, U±} to this maximal set by 'eigenvalue equations' 
and 

[T+,T_] = 2T , (12.95) 
[U+,U-] = ^Y-T = 2U , (12.96) 

[V+,V-] = ^Y + T = 2V , (12.97) 

which relate commutators of generators with opposite eigenvalues to the maximal set {Y, To}. Note 
that, Uo and Vq are linear combinations of To and Y. Finally, we have 

[T+,V-] = -U-, (12.98) 

[T + ,U+] = V+, (12.99) 

[U+,V-] = T_, (12.100) 

[T+,V+] = 0, (12.101) 

[T+,U-] = 0, (12.102) 

[U+,V+] = 0. (12.103) 

Any remaining commutators follow from hermiticity. 

The same way the angular momentum ladder operators have been used to construct the represen- 
tations of SU(2), we will use these commutation relations to construct representations of SU(3). 
In the case of SU{2) the representations lay on a line on the Jo ax is- However, since there are two 
mutually commuting generators in SU{3) as given in (12.88), the representations will now lie in a 
To — Y-plane. The maximum number of mutually commuting generators of a Lie algebra is called 
its rank. Thus, SU{2) has rank 1, while SU(3) has rank 2. 

When the basis of a Lie algebra is expressed in such a way to satisfy the form of the eigenvalue 
relations as given above, it is said to be in Cartan-Weyl form. This form is essential for easy labeling 
of the representations of the group, as the relation between the states in a given representation 
can be conveniently expressed in terms of ladder operators. A formal definition and a detailed 
discussion of the Cartan-Weyl form is beyond the scope of this chapter. The interested reader is 
instead referred to a very readable account given in chapter 12 of [4] . 

Another important property of SU (2) is the existence of an operator (namely the total angular 
momentum, J 2 ) which commutes with all of the generators. An operator which commutes with all 
generators of a Lie group is called a Casimir operator. As in the case of J 2 and SU(2), Casimir 
operators can be used to label irreducible representations of the Lie algebra, similar to the way it 
was done in section 5.5. We can construct two such independent Casimir operators for the group 
SU(3). 

Ci = E A * = -fE/^W*, (12-104) 

k jkl 

C 2 = Y. d i klX i XkXl (12.105) 

jkl 



In general the number of independent Casimir operators of a Lie group is equal to its rank. 
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The utility of Casimir operators arises from the fact that all states in a given representation assume 
the same value for a Casimir operator. This is because the states in a given representation are 
connected by the action of the generators of the Lie algebra and such generators commute with 
the Casimir operators. This property can be used to label representations in terms of the values 
of the Casimir operators. For example, it was shown in section 5.5 how to label the irreducible 
representations of the angular momentum algebra SU{2) in terms of the value of the total angular 
momentum. 



Now we will construct explicit representations of SU(3). Because of (12.88) we can label states by 
the eigenvalues of Tq and Y operators, |*3,y): 



3b |*3,J/) 
Y\h,y) 



h 1*3, y) , 
y 1*3, y) • 



(12.106) 
(12.107) 



From the commutation relations we have presented above (the Cartan-Weyl form) we can write 
down the effect of various generators on the state |*3,y). For example, we have 



U \t 3 ,y) 

Vb|* 3 ,y) 



3 1 

(^y- 2*3) 1*3, y), 
(|y+2*3)l*3,y)- 



(12.108) 
(12.109) 



The same way that J± \m) is proportional to \m ± 1) in the case of the angular momentum algebra, 
we have 



T±|* 3 ,y) 
U±\t 3 ,y) = P 

V±|*3,y) = 7 



h±-,y±l 



(12.110) 
(12.111) 
(12.112) 



The effect of these operators to the states in the y — *3 plane have been outlined in Fig. (12.1). 
The representations of SU (3) are constructed analogous to those of SU(2) by identifying the 'bound- 
ary' states annihilated by raising (or lowering) operators. All other states of the representation 
are then constructed by successive application of ladder operators T±, U±, V±. The representations 
for hexagons with sides of length p and q in the To — Y-plane. Such a representation is labeled as 
D(p,q) and it has a dimensionality of \{p + l)(q + l)(p + q + 2). Figure (12.2) shows the repre- 
sentation D(2, 1) as an example. The details of this procedure is beyond the scope of this chapter. 
The interested reader is referred to [4], especially chapters 7 and 8. 

As another example for the representations of SU{3), the pion family forms part of an octet corre- 
sponding to the D(l, 1) representation. The representations -D(1,0) and -D(0, 1) correspond to the 
triplets of quarks and antiquaries, respectively. (See Fig. (12.3).) All other representations can be 
constructed by combining these two conjugate representations. For example the pion octet (or any 
other meson octet) is therefore realized as states of a quark - antiquark pair. A notation suggestive 
of the dimensionality of the representation can be used to identify representations. For example, 
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Figure 12.1: The effect of 577(3) ladder operators on the y — ts plane. 




Figure 12.2: D(2, 1) representation of 577(3). The states in the inner triangle are doubly degenerate. 
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D(1,0) and D(0, 1) are denoted by [3] and [3], respectively. The octet D(l, 1) is written as [8] etc. 
This way the quark-antiquark states can be represented as follows 

[3] ® [3] = [8] [1] (12.113) 

The additional singlet state corresponds to the rf meson. This expansion can be compared, for 
example, to the case of adding two spin triplet states, in the case of SU(2), where we would write 

[3] (g> [3] = [1] © [3] © [5]. (12.114) 



y 

A 




Figure 12.3: D(1,0) or the fundamental representation of flavor SU(3) symmetry. 
The three quarks in the fundamental representation can now be written as 



u 



1 1 

2' 3 
1 1 

~2' 3 
2 

0,-o 



The Gell-Mann-Nishijima relation can then be succinctly expressed as 

Q = 2 y + * 3 ' 



(12.115) 
(12.116) 
(12.117) 

(12.118) 
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from which the quark charges follow 



Qu 
Q d = 
Q s = 



2 

3' 
1 

3' 
1 

~3" 



(12.119) 
(12.120) 
(12.121) 
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Figure 12.4: The baryon octet as a D(l, 1) representation of SU(3). 
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